Browsed by
[Tag:] AI

Turing Award Goes to 2 Pioneers of Artificial Intelligence

Turing Award Goes to 2 Pioneers of Artificial Intelligence

Andrew Barto and Richard Sutton developed reinforcement learning, a technique vital to chatbots like ChatGPT. In 1977, Andrew Barto, as a researcher at the University of Massachusetts, Amherst, began exploring a new theory that neurons behaved like hedonists. The basic idea was that the human brain was driven by billions of nerve cells that were each trying to maximize pleasure and minimize pain. A year later, he was joined by another young researcher, Richard Sutton. Together, they worked to explain human…

Read More Read More

AI systems based on two model types

AI systems based on two model types

The current frontier of AI systems is based on two model types(they are almost identical under the hood, but their behavior is notably different in practice, hence the distinction) Pre-trained models, also known as ‘non-reasoning models’ These are the famous ‘Large Language Models’, or LLMs, gigantic AI models trained on as much as data as possible, reaching double digits of trillions of words (for reference, Lama 3.1 405B was trained on 15 trillion tokens ~ 11-12.5 trillion words, and DeepSeek…

Read More Read More

How did DeepSeek build its A.I. with less money?

How did DeepSeek build its A.I. with less money?

The Chinese start-up used several technological tricks, including a method called “mixture of experts” to significantly reduce the cost of building the technology. A.I. companies typically train their chatbots using supercomputers packed with 16,000 specialized chips or more. But DeepSeek said it needed only about 2,000. DeepSeek engineers needed only about $6 million in raw computing power, roughly one-tenth of what Meta spent in building its latest A.I. technology. What exactly did DeepSeek do? How are A.I. technologies built? Companies…

Read More Read More

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek made two critical changes. Firstly. the architecture. OpenAI uses an AI architecture known as “fully dense”. This basically means that the architecture is comprised of a single, vast network that processes every request with all its parameters and data points. This is incredibly computationally dense, but the idea is that it can make it more capable in a broader application. DeepSeek is instead much more picky and uses a “mixture of experts” architecture. In this approach, the AI is…

Read More Read More

What is Distillation in A.I. ?

What is Distillation in A.I. ?

DeepSeek shook up the U.S. stock market, and it’s still creating shock wavers around world. But the newest allegation is that DeepSeek actually used a particular process to put together its training data, and it’s one that some consider to be a little shady. The new U.S. president’s AI and crypto czar David Sacks is one of those who is getting in on the action, saying in an interview with Fox News that there was “substantial evidence” that this kind…

Read More Read More

DeepSeek

DeepSeek

DeepSeek’s breakthrough on cost challenges the “bigger is better” narrative that has driven the A.I. arms race in recent years by showing that relatively small models, when trained properly, can match or exceed the performance of much bigger models. That, in turn, means that A.I. companies may be able to achieve very powerful capabilities with far less investment than previously thought. And it suggests that we may soon see a flood of investment into smaller A.I. start-ups, and much more…

Read More Read More

Stargate

Stargate

OpenAI, Oracle and SoftBank formed a new joint venture called Stargate to invest in data centers, building on major U.S. investments in the technology. On Tuesday(2025.1.21), President Trump announced a joint venture between OpenAI, SoftBank, and Oracle called Stargate, which aims to invest at least $100 billion in U.S. data centers. The group behind the project said it could invest as much as half a trillion dollars in Stargate over the next four years. Elon Musk, who runs a competing…

Read More Read More

OpenAI Details Plan for Becoming a For-Profit Company

OpenAI Details Plan for Becoming a For-Profit Company

OpenAI revealed details on Friday(‘24.12.17) about its plan to adopt a new corporate structure that will remove the company from control by a nonprofit that has been the focus of contention. OpenAI said it planned to restructure as a public benefit corporation, or P.B.C., which is a for-profit corporation designed to crate public and social good. OpenAI rivals like Anthropic and Elon Musk’s xAI use a similar structure. OpenAI’s latest funding round valued the company at $157billion. OpenAI에 대해 짧게…

Read More Read More

Is the Tech Industry Already on the Cusp of an A.I. Slowdown?

Is the Tech Industry Already on the Cusp of an A.I. Slowdown?

Companies like Open AI and Google are running out of the data used to train artificial intelligence systems. Can new methods continue years of rapid progress? Demis Hassabis, one of the most influential artificial intelligence experts in the world, has a warning for the rest of the tech industry: Don’t expect chatbots to continue to improve as quickly as they have over the last few years. A.I. researchers have for some time been relying on a fairly simple concept to…

Read More Read More

박태웅의 AI 강의 2025

박태웅의 AI 강의 2025

P149거대 언어모델은 언어에 대한 좋은 모델이지만, 인간 사고에 대해서는 불완전한 모델이라는 것입니다. 이런 차이 때문에 ‘형식적 언어 능력’이 필요한 과제에서는 거대언어 모델이 인상적인 성과를 보이지만, ‘기능적 능력’이 필요한 많은 테스트에서는 실패한다는 것입니다. 이들은 (1) 현재의 거대 언어모델은 형식적 언어 능력의 모델로서 진지하게 받아들여야 하며 (2) 실제 언어 사용을 마스트하는 모델은 핵심 언어 모델뿐만 아니라 사고 모델링에 필요한 여러 비언어적 인지능력을 통합하거나 추가 개발할 필요가 있다고 주장합니다. 지난 목요일(11.21) 회의차 한국 마이크로소프트를 방문하게 되었는데 조금 일찍 도착한 터라 교보문고에서 책을 좀…

Read More Read More