Browsed by
[Category:] 공부

숲 해설을 위한 영어 공부

What does a software architect do?

What does a software architect do?

A software architect is a programmer; and continues to be a programmer. Never fall for the lie that suggests that software architects pull back from code to focus on higher-level issues. They do not! Software architects. are the best programmers, and they continue to take programming tasks, while they also guide the rest of the team toward a design that maximizes productivity. Software architects may not write as much code as other programmers do, but they continue to engage in…

Read More Read More

Turing Award Goes to 2 Pioneers of Artificial Intelligence

Turing Award Goes to 2 Pioneers of Artificial Intelligence

Andrew Barto and Richard Sutton developed reinforcement learning, a technique vital to chatbots like ChatGPT. In 1977, Andrew Barto, as a researcher at the University of Massachusetts, Amherst, began exploring a new theory that neurons behaved like hedonists. The basic idea was that the human brain was driven by billions of nerve cells that were each trying to maximize pleasure and minimize pain. A year later, he was joined by another young researcher, Richard Sutton. Together, they worked to explain human…

Read More Read More

Spring Boot Caching with Redis

Spring Boot Caching with Redis

요즘 Spring을 활용하는 방법에 익숙해 지려고 노력하고 있다. 사실 Spring은 거의 15년여 전 Spring Framework 초기에 잠깐 사용해보고 그 이후는 경험이 전무한 터라 개념 정도는 얘기할 수 있을지라도 요즘 개발자들과 Spring 활용에 대해 말을 썩을 수 있는 처지가 못된다. 이 참에 Spring에 녹아있는 핵심적인 기술들을 하나씩 꺼내와서 이해하고 정리해 보려고 한다. 이 글은 그 시리즈의 첫 번째로 Spring Cache에 대한 내용이다. 인터넷에서 흔히 소개되고 있는 내용들 보다 좀 더 깊숙이 들어가서 어떤 매커니즘에 의해 Caching이 적용되고 동작하는 지 살펴보도록 한다….

Read More Read More

Nvidia and DeepSeek

Nvidia and DeepSeek

DeepSeek challenged a tech industry consensus that to build bigger and better A.I. systems, companies would have to build bigger and more powerful data centers. It set off fears that companies might pullback on their spending with Nvidia. Since then, a new consensus has emerged that Nvidia will continue to benefit because it will become affordable for more companies to develop A.I. systems. An expanded field of A.I. business would create more customers for Nvidia’s expensive chips, not fewer, as…

Read More Read More

AI systems based on two model types

AI systems based on two model types

The current frontier of AI systems is based on two model types(they are almost identical under the hood, but their behavior is notably different in practice, hence the distinction) Pre-trained models, also known as ‘non-reasoning models’ These are the famous ‘Large Language Models’, or LLMs, gigantic AI models trained on as much as data as possible, reaching double digits of trillions of words (for reference, Lama 3.1 405B was trained on 15 trillion tokens ~ 11-12.5 trillion words, and DeepSeek…

Read More Read More

How did DeepSeek build its A.I. with less money?

How did DeepSeek build its A.I. with less money?

The Chinese start-up used several technological tricks, including a method called “mixture of experts” to significantly reduce the cost of building the technology. A.I. companies typically train their chatbots using supercomputers packed with 16,000 specialized chips or more. But DeepSeek said it needed only about 2,000. DeepSeek engineers needed only about $6 million in raw computing power, roughly one-tenth of what Meta spent in building its latest A.I. technology. What exactly did DeepSeek do? How are A.I. technologies built? Companies…

Read More Read More

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek just Exposed the Rot at the Core of the AI Industry

DeepSeek made two critical changes. Firstly. the architecture. OpenAI uses an AI architecture known as “fully dense”. This basically means that the architecture is comprised of a single, vast network that processes every request with all its parameters and data points. This is incredibly computationally dense, but the idea is that it can make it more capable in a broader application. DeepSeek is instead much more picky and uses a “mixture of experts” architecture. In this approach, the AI is…

Read More Read More

What is Distillation in A.I. ?

What is Distillation in A.I. ?

DeepSeek shook up the U.S. stock market, and it’s still creating shock wavers around world. But the newest allegation is that DeepSeek actually used a particular process to put together its training data, and it’s one that some consider to be a little shady. The new U.S. president’s AI and crypto czar David Sacks is one of those who is getting in on the action, saying in an interview with Fox News that there was “substantial evidence” that this kind…

Read More Read More

DeepSeek

DeepSeek

DeepSeek’s breakthrough on cost challenges the “bigger is better” narrative that has driven the A.I. arms race in recent years by showing that relatively small models, when trained properly, can match or exceed the performance of much bigger models. That, in turn, means that A.I. companies may be able to achieve very powerful capabilities with far less investment than previously thought. And it suggests that we may soon see a flood of investment into smaller A.I. start-ups, and much more…

Read More Read More

Stargate

Stargate

OpenAI, Oracle and SoftBank formed a new joint venture called Stargate to invest in data centers, building on major U.S. investments in the technology. On Tuesday(2025.1.21), President Trump announced a joint venture between OpenAI, SoftBank, and Oracle called Stargate, which aims to invest at least $100 billion in U.S. data centers. The group behind the project said it could invest as much as half a trillion dollars in Stargate over the next four years. Elon Musk, who runs a competing…

Read More Read More