Browsed by
[Month:] 2025년 02월

Nvidia and DeepSeek

Nvidia and DeepSeek

DeepSeek challenged a tech industry consensus that to build bigger and better A.I. systems, companies would have to build bigger and more powerful data centers. It set off fears that companies might pullback on their spending with Nvidia. Since then, a new consensus has emerged that Nvidia will continue to benefit because it will become affordable for more companies to develop A.I. systems. An expanded field of A.I. business would create more customers for Nvidia’s expensive chips, not fewer, as…

Read More Read More

About KFC

About KFC

Harland Sanders, known universally as “Colonel Sanders,” started selling chicken from a roadside motel in 1930 in Kentucky. He prepared the meat in a pressure cooker, which sealed in the flavor. The secret chicken recipe, including its famous 11 herbs and spices, was established in. 1939, and the colonel began wearing his trademark white suit in 1950. The first Kentucky Fried Chicken franchise opened in 1952 near Salt Lake City. After selling the company in 1964, Mr. Sanders – his…

Read More Read More

AI systems based on two model types

AI systems based on two model types

The current frontier of AI systems is based on two model types(they are almost identical under the hood, but their behavior is notably different in practice, hence the distinction) Pre-trained models, also known as ‘non-reasoning models’ These are the famous ‘Large Language Models’, or LLMs, gigantic AI models trained on as much as data as possible, reaching double digits of trillions of words (for reference, Lama 3.1 405B was trained on 15 trillion tokens ~ 11-12.5 trillion words, and DeepSeek…

Read More Read More

How did DeepSeek build its A.I. with less money?

How did DeepSeek build its A.I. with less money?

The Chinese start-up used several technological tricks, including a method called “mixture of experts” to significantly reduce the cost of building the technology. A.I. companies typically train their chatbots using supercomputers packed with 16,000 specialized chips or more. But DeepSeek said it needed only about 2,000. DeepSeek engineers needed only about $6 million in raw computing power, roughly one-tenth of what Meta spent in building its latest A.I. technology. What exactly did DeepSeek do? How are A.I. technologies built? Companies…

Read More Read More

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek made two critical changes. Firstly. the architecture. OpenAI uses an AI architecture known as “fully dense”. This basically means that the architecture is comprised of a single, vast network that processes every request with all its parameters and data points. This is incredibly computationally dense, but the idea is that it can make it more capable in a broader application. DeepSeek is instead much more picky and uses a “mixture of experts” architecture. In this approach, the AI is…

Read More Read More