Newsletter

DeepSeek: A New Era of AI Brinkmanship





DeepSeek, a Chinese artificial intelligence company founded in 2023 by former hedge fund manager Liang Wenfeng, has rapidly emerged as a formidable challenger in the global AI race. Backed by Wenfeng's quantitative hedge fund High Flyer, this newcomer has achieved something remarkable: developing large language models (LLMs) that rival industry giants like OpenAI and Google, while reportedly using significantly less resources. This dramatic rise has sparked intense discussions about a potential new era in the US-China AI competition landscape.

DeepSeek's Innovative Approach to AI Development

Despite facing U.S. sanctions limiting access to advanced AI chips, DeepSeek employed creative optimization strategies to overcome these constraints. Their success stems from several innovative approaches:

Architectural and Training Innovations

DeepSeek implemented two particularly notable optimizations in their development process:

  • Hardware-level programming: Their team created specialized low-level programming solutions to address bandwidth limitations in their hardware.
  • Self-improvement reinforcement learning: They developed a reinforcement learning approach that minimizes human intervention, allowing their models to improve iteratively through their own learning.

The company utilizes a "mixture of experts" architecture instead of relying on a single monolithic model. This approach employs specialized models for different tasks, significantly improving both efficiency and performance.

Training Process and Infrastructure

DeepSeek's training methodology follows a three-stage approach:

  1. Pretraining: Using 1.8 trillion tokens (87% source code, 10% code-related English, 3% Chinese content)
  2. Long-context pretraining: 200 billion tokens, extending context length from 4K to 16K tokens
  3. Supervised fine-tuning: 2 billion tokens of instruction data

To support this ambitious training regimen, DeepSeek built proprietary computing clusters. Their first cluster, Fire-Flyer (萤火一号), was constructed between 2019 and 2020 at a cost of 200 million yuan. It featured 1,100 GPUs interconnected at 200 Gbps, though it was retired after 18 months of operation.

Market Entry and Rapid Growth

DeepSeek's first free chatbot app, released in January 2025 for iOS and Android, became an immediate sensation. Within weeks, it claimed the top spot as the most downloaded free app on the iOS App Store in the United States, surpassing even ChatGPT. This explosive growth reportedly contributed to an 18% drop in Nvidia's share price, highlighting the market's reaction to this disruptive force.

DeepSeek's Product Ecosystem

DeepSeek has rapidly developed a comprehensive ecosystem to deliver its AI capabilities across different platforms:

  • Mobile applications: Free chatbot apps for iOS and Android devices
  • Browser extension: The DeepSeek Chrome extension offers advanced research capabilities, multi-source analysis, and visual data interpretation
  • Web interface: Direct access through DeepSeek's website
  • Developer API: Compatible with OpenAI's API format for seamless integration into existing applications
  • Cloud platforms: Available through Azure AI Foundry and GitHub for developer access

These products support various applications, including browser-based user agents that can perform web searches and complex tasks like flight bookings, as well as web scraping capabilities when integrated with tools like Grok.

For users concerned about data security and privacy, DeepSeek offers multiple access options:

  • Through US-based providers like Perplexity that run the model in their own data centers
  • By downloading the open-source models from Hugging Face to run locally on personal hardware

DeepSeek vs. Established Competitors

DeepSeek's rapid emergence has intensified competition across the AI landscape, particularly between the U.S. and China. Some analysts view this as a potential "Sputnik moment" for AI development, while others see it as an overreaction.

DeepSeek vs. OpenAI

In response to DeepSeek's competitive challenge, OpenAI has:

  • Formed strategic partnerships, like its collaboration with Kakao in South Korea
  • Launched enhanced features such as "deep research" for ChatGPT Pro users
  • Emphasized its continued advantages in reasoning capabilities and accuracy

DeepSeek vs. Google Gemini

Google has responded to DeepSeek's market entry by focusing on:

  • Multimodal capabilities across text, images, and audio
  • Improved processing speed and efficiency
  • Offering a free tier to maintain accessibility

In direct comparisons, Gemini has shown advantages in creative writing and code generation, while DeepSeek has demonstrated strengths in data structuring and providing detailed responses.

DeepSeek vs. Meta's LLaMA

Meta's open-source LLaMA model provides another interesting comparison point:

  • DeepSeek's mixture-of-experts architecture delivers superior performance on complex tasks
  • LLaMA maintains advantages in lightweight applications and edge device deployment
  • Both models follow open-source approaches, though with different architectural philosophies

Controversies and Challenges

DeepSeek's rapid rise hasn't been without significant controversies:

Security and Privacy Concerns

Multiple government entities have taken action regarding DeepSeek:

  • Taiwan and Texas have banned the application on government devices
  • The US Navy has blocked access to DeepSeek services
  • These restrictions stem from concerns about user data collection and potential data access by the Chinese government

Technical Vulnerabilities

Research has uncovered potential security weaknesses:

  • DeepSeek was targeted by DDoS attacks including NTP and SSDP reflection attacks
  • Researchers found DeepSeek highly susceptible to jailbreak techniques, with one test showing a 100% attack success rate compared to 26% for OpenAI's o1 model

Ethical and Development Controversies

Several ethical questions have surrounded DeepSeek's development:

  • Some experts have questioned DeepSeek's claims about low-cost development, with evidence suggesting potential data exfiltration from OpenAI models
  • White House AI advisor David Sacks has raised intellectual property theft concerns
  • DeepSeek's responses have been found to censor topics sensitive to the Chinese government
  • The open-source approach raises dual-use concerns about potential misuse for malicious purposes

The Future of AI Development

DeepSeek's emergence occurs within a rapidly evolving AI landscape that includes several important trends:

  • Generative AI: Creating original content across text, images, and code
  • Explainable AI (XAI): Making AI decision processes more transparent
  • Edge AI: Deploying models on local devices for faster processing and enhanced privacy

The UK government has announced a new AI Code of Practice aimed at securing AI systems against hacking and sabotage, partly in response to models like DeepSeek. Meanwhile, researchers at UC Berkeley have created TinyZero, a limited replica of DeepSeek R1 for just $30, highlighting how AI development costs might continue to decrease.

Conclusion: A Shifting AI Landscape

DeepSeek's meteoric rise represents a potential paradigm shift in AI development. Its competitive models, developed at what appears to be a fraction of its rivals' cost and computing power, have intensified global competition in artificial intelligence.

While DeepSeek's open-source approach has been praised for democratizing AI access and fostering collaboration, it simultaneously raises serious concerns about security, data privacy, censorship practices, and potential misuse. As AI capabilities continue advancing rapidly, the industry, governments, and society must carefully balance innovation with responsible deployment.

The ongoing evolution of AI technologies like DeepSeek promises to transform industries and daily life, making this a pivotal moment in the development of artificial intelligence. The coming months and years will reveal whether DeepSeek maintains its momentum and how established players adapt to this new competitive landscape.



References

1. DeepSeek is the newest front in the AI competition between the US and China, accessed February 4, 2025, https://www.foxbusiness.com/technology/deepseek-newest-front-ai-competition-between-us-china

2. DeepSeek - Wikipedia, accessed February 4, 2025, https://en.wikipedia.org/wiki/DeepSeek

3. DeepSeek’s ‘open AI’ should terrify Sam Altman, accessed February 4, 2025, https://www.taipeitimes.com/News/editorials/archives/2025/02/05/2003831334

4. DeepSeek: Making Sense of the Reaction—and Overreaction, accessed February 4, 2025, https://www.cfr.org/article/deepseek-making-sense-reaction-and-overreaction

5. WashU Expert: How DeepSeek changes the AI industry - The Source, accessed February 4, 2025, https://source.washu.edu/2025/02/washu-expert-how-deepseek-changes-the-ai-industry/

6. DeepSeek Explained: What Is It and Is It Safe To Use? | News - AI@ND, accessed February 4, 2025, https://ai.nd.edu/news/deepseek-explained-what-is-it-and-is-it-safe-to-use/

7. Is Nvidia in Deep Trouble Due to DeepSeek? - The Motley Fool, accessed February 4, 2025, https://www.fool.com/investing/2025/02/03/is-nvidia-in-deep-trouble-with-deepseek/

8. DeepSeek - Chrome Web Store, accessed February 4, 2025, https://chromewebstore.google.com/detail/deepseek/inhcgfpbfdjbjogdfjbclgolkmhnooop

9. DeepSeek API Docs: Your First API Call, accessed February 4, 2025, https://api-docs.deepseek.com/

10. DeepSeek R1 is now available on Azure AI Foundry and GitHub | Microsoft Azure Blog, accessed February 4, 2025, https://azure.microsoft.com/en-us/blog/deepseek-r1-is-now-available-on-azure-ai-foundry-and-github/

11. Build a Browser Use Agent with DeepSeek: A Step-by-Step Guide - DEV Community, accessed February 4, 2025, https://dev.to/nodeshiftcloud/build-a-browser-use-agent-with-deepseek-a-step-by-step-guide-2n59

12. Scrape Any Website for FREE Using DeepSeek & Crawl4AI - YouTube, accessed February 4, 2025, https://www.youtube.com/watch?v=Osl4NgAXvRk

13. Sam Altman of OpenAI partners with South Korea’s Kakao after DeepSeek scare, accessed February 4, 2025, https://americanbazaaronline.com/2025/02/04/sam-altman-of-openai-partners-with-south-koreas-kakao-after-deepseek-scare459086/

14. OpenAI chief Altman inks deal with S Korea’s Kakao after DeepSeek upset, accessed February 4, 2025, https://www.aljazeera.com/economy/2025/2/4/openai-chief-altman-inks-deal-with-s-koreas-kakao-after-deepseek-upset

15. OpenAI Responds to DeepSeek Hype with ‘Deep Research’ ChatGPT Agent, accessed February 4, 2025, https://decrypt.co/304102/openai-responds-to-deepseek-hype-with-deep-research-chatgpt-agent

16. OpenAI’s ‘deep research’ Might Just Outthink Google and DeepSeek, accessed February 4, 2025, https://analyticsindiamag.com/global-tech/openais-deep-research-might-just-out-think-google-and-deepseek/

17. DeepSeek vs. ChatGPT vs. Google Gemini: How China's AI challenger compares to US rivals | - The Times of India, accessed February 4, 2025, https://timesofindia.indiatimes.com/technology/tech-news/deepseek-vs-chatgpt-vs-google-gemini-how-chinas-ai-challenger-compares-to-us-rivals/articleshow/117786839.cms

18. DeepSeek-V3 vs Gemini 2.0 Flash (Experimental) - Detailed Performance & Feature Comparison - DocsBot AI, accessed February 4, 2025, https://docsbot.ai/models/compare/deepseek-v3/gemini-2-0-flash

19. I tested DeepSeek vs Gemini AI with 7 prompts — here's the winner - Tom's Guide, accessed February 4, 2025, https://www.tomsguide.com/ai/i-tested-deepseek-vs-gemini-ai-with-7-prompts-heres-the-winner

20. I tried out DeepSeek, but I'm sticking with Gemini for now - Android Authority, accessed February 4, 2025, https://www.androidauthority.com/deepseek-vs-gemini-3521178/

21. DeepSeek vs Llama vs GPT-4 | Open-Source AI Models Compared - Civo.com, accessed February 4, 2025, https://www.civo.com/blog/deepseek-vs-llama-vs-gpt4-ai-models

22. Demystifying Deepseek AI, LLaMA and OpenAI: | by Anil Prasad | Jan, 2025 | Medium, accessed February 4, 2025, https://medium.com/@anilAmbharii/demystifying-deepseek-ai-llama-and-openai-8d28c7857bda

23. DeepSeek-V3 vs Llama 3.3 70B Instruct - Detailed Performance & Feature Comparison, accessed February 4, 2025, https://docsbot.ai/models/compare/deepseek-v3/llama-3-3-70b-instruct

24. Llama 3.2 3B vs DeepSeek V3: Comparing Efficiency and Performance | by Novita AI, accessed February 4, 2025, https://medium.com/@marketing_novita.ai/llama-3-2-3b-vs-deepseek-v3-comparing-efficiency-and-performance-7302eee11999

25. Taiwan Bans DeepSeek AI Over National Security Concerns, Citing Data Leakage Risks, accessed February 4, 2025, https://thehackernews.com/2025/02/taiwan-bans-deepseek-ai-over-national.html

26. China's DeepSeek ban starts in America with the State that an 'angry' Elon Musk moved two of his biggest companies' headquarters to after 'dumping' California - The Times of India, accessed February 4, 2025, https://timesofindia.indiatimes.com/technology/tech-news/chinas-deepseek-ban-starts-in-america-with-the-state-that-an-angry-elon-musk-moved-two-of-his-biggest-companies-headquarters-to-after-dumping-california/articleshow/117905369.cms

27. DeepSeek shakes up the AI competition with R1 - Hispanic Engineer & Information Technology, accessed February 4, 2025, https://hispanicengineer.com/manage-new/deepseek-shakes-up-the-ai-competition-with-r1/

28. DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test, accessed February 4, 2025, https://www.securityweek.com/deepseek-compared-to-chatgpt-gemini-in-ai-jailbreak-test/

29. Chinese tech startup DeepSeek's chatbot sparks discussion about AI competition - PBS, accessed February 4, 2025, https://www.pbs.org/newshour/world/chinese-tech-startup-deepseeks-chatbot-sparks-discussion-about-ai-competition

30. DeepSeek AI raises national security concerns, U.S. officials say - CBS News, accessed February 4, 2025, https://www.cbsnews.com/news/deepseek-ai-raises-national-security-concerns-trump/

31. VERSES® Genius™ Outperforms DeepSeek R1 Model in Code-Breaking “Mastermind” Challenge | Morningstar, accessed February 4, 2025, https://www.morningstar.com/news/globe-newswire/9352709/verses-genius-outperforms-deepseek-r1-model-in-code-breaking-mastermind-challenge

32. DeepSeek AI replicated for just $30 using Countdown game - The Independent, accessed February 4, 2025, https://www.independent.co.uk/tech/ai-deepseek-b2691112.html

33. 7 Emerging AI Technologies to Watch in 2025 - Brilworks, accessed February 4, 2025, https://www.brilworks.com/blog/emerging-ai-technologies/

34. The profundity of DeepSeek's challenge to America - Asia Times, accessed February 4, 2025, https://asiatimes.com/2025/02/the-profundity-of-deepseeks-challenge-to-america/


Comments