Browsed by
[Category:] 공부

숲 해설을 위한 영어 공부

AI systems based on two model types

AI systems based on two model types

The current frontier of AI systems is based on two model types(they are almost identical under the hood, but their behavior is notably different in practice, hence the distinction) Pre-trained models, also known as ‘non-reasoning models’ These are the famous ‘Large Language Models’, or LLMs, gigantic AI models trained on as much as data as possible, reaching double digits of trillions of words (for reference, Lama 3.1 405B was trained on 15 trillion tokens ~ 11-12.5 trillion words, and DeepSeek…

Read More Read More

How did DeepSeek build its A.I. with less money?

How did DeepSeek build its A.I. with less money?

The Chinese start-up used several technological tricks, including a method called “mixture of experts” to significantly reduce the cost of building the technology. A.I. companies typically train their chatbots using supercomputers packed with 16,000 specialized chips or more. But DeepSeek said it needed only about 2,000. DeepSeek engineers needed only about $6 million in raw computing power, roughly one-tenth of what Meta spent in building its latest A.I. technology. What exactly did DeepSeek do? How are A.I. technologies built? Companies…

Read More Read More

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek made two critical changes. Firstly. the architecture. OpenAI uses an AI architecture known as “fully dense”. This basically means that the architecture is comprised of a single, vast network that processes every request with all its parameters and data points. This is incredibly computationally dense, but the idea is that it can make it more capable in a broader application. DeepSeek is instead much more picky and uses a “mixture of experts” architecture. In this approach, the AI is…

Read More Read More

What is Distillation in A.I. ?

What is Distillation in A.I. ?

DeepSeek shook up the U.S. stock market, and it’s still creating shock wavers around world. But the newest allegation is that DeepSeek actually used a particular process to put together its training data, and it’s one that some consider to be a little shady. The new U.S. president’s AI and crypto czar David Sacks is one of those who is getting in on the action, saying in an interview with Fox News that there was “substantial evidence” that this kind…

Read More Read More

DeepSeek

DeepSeek

DeepSeek’s breakthrough on cost challenges the “bigger is better” narrative that has driven the A.I. arms race in recent years by showing that relatively small models, when trained properly, can match or exceed the performance of much bigger models. That, in turn, means that A.I. companies may be able to achieve very powerful capabilities with far less investment than previously thought. And it suggests that we may soon see a flood of investment into smaller A.I. start-ups, and much more…

Read More Read More

Stargate

Stargate

OpenAI, Oracle and SoftBank formed a new joint venture called Stargate to invest in data centers, building on major U.S. investments in the technology. On Tuesday(2025.1.21), President Trump announced a joint venture between OpenAI, SoftBank, and Oracle called Stargate, which aims to invest at least $100 billion in U.S. data centers. The group behind the project said it could invest as much as half a trillion dollars in Stargate over the next four years. Elon Musk, who runs a competing…

Read More Read More

OpenAI Details Plan for Becoming a For-Profit Company

OpenAI Details Plan for Becoming a For-Profit Company

OpenAI revealed details on Friday(‘24.12.17) about its plan to adopt a new corporate structure that will remove the company from control by a nonprofit that has been the focus of contention. OpenAI said it planned to restructure as a public benefit corporation, or P.B.C., which is a for-profit corporation designed to crate public and social good. OpenAI rivals like Anthropic and Elon Musk’s xAI use a similar structure. OpenAI’s latest funding round valued the company at $157billion. OpenAI에 대해 짧게…

Read More Read More

A Story about Building Container Images

A Story about Building Container Images

최근 들어 Docker Container에 대해 깊은 이해가 필요하여 이 참에 어렴풋이 알고 있던 Container에 대해 제대로 함 알아보자는 요량으로 관련 서적을 뒤지다가 가장 최근의 책이라고 여겨지는 “Docker Deep Dive 2024 edition” 을 골라 습독을 했다. 대략 Docker Container에 대한 내용은 익히 알고 있던터라 개념적 이해는 쉽게 지나갔지만 실제 이 기술을 이용하고 싶을 때 막히지 않을 정도로 몸에 익혀보자는 생각으로 책에 나온 샘플 코드를 이리 저리 내가 활용해 보고 싶은 시나리오로 변형하고 적용하다 보니 계획했던 2주 보다 시간이 좀 더 걸렸다….

Read More Read More

Is the Tech Industry Already on the Cusp of an A.I. Slowdown?

Is the Tech Industry Already on the Cusp of an A.I. Slowdown?

Companies like Open AI and Google are running out of the data used to train artificial intelligence systems. Can new methods continue years of rapid progress? Demis Hassabis, one of the most influential artificial intelligence experts in the world, has a warning for the rest of the tech industry: Don’t expect chatbots to continue to improve as quickly as they have over the last few years. A.I. researchers have for some time been relying on a fairly simple concept to…

Read More Read More

Docker Volume 알아채기

Docker Volume 알아채기

Docker Volume을 create, backup, migrate, restore하는 일련의 예제를 작성해 봄으로써 Docker를 활용할 때 데이터를 영구적으로 저장/유지하는 방법에 대해 알아본다. Docker가 무엇인지 먼저 간략하게 알아보기 Docker는 어플리케이션을 개발(developing), 배포(shipping), 실행(running)하는 하나의 오픈 플랫폼(Open platform)이라 할 수 있다. 또한 어플리케이션을 운영 인프라로부터 개념적으로 분리시켜 줌으로써 보다 빠르게 소프트웨어를 배포할 수 있도록 한다. Docker를 활용하면 응용 프로그램을 관리하는 것과 같은 방식으로 앱의 인프라(infrastructure)를 관리할 수 있어 개발자에게 전체 개발 프로세스를 상당히 간소화 시켜준다. 코드를 배포, 테스트, 배포를 빠르게 해주는 Docker 방법론을 활용하면 개발자는…

Read More Read More