홈
태그
방명록

분류 전체보기 (12)

ABOUT ME

-

트위터
인스타그램

Today: -

Yesterday: -

Total: -

AI 지식창고 AI 지식창고

컨텐츠 검색 블로그 내 검색

Proximal Policy Optimization

Proximal Policy Optimization (PPO) Algorithm
CS & ML Basic 2023. 2. 14. 16:02

The objective of the Proximal Policy Optimization (PPO) algorithm is to train a policy function that can control an agent's behavior in a given environment, such that it maximizes the expected cumulative reward over time. More formally, we can define the objective of PPO as follows: $$J(\theta) = \mathbb{E}{\pi{\theta}}\left[\sum_{t=0}^{\infty}\gamma^{t}r_{t}\right]$$ where \(J(\theta)\) is the ..

이전

1

다음

인기포스트

ABOUT ME

AI 논문 리뷰 요리 그리고 여행 기록.

LINK

ADMIN

admin 글쓰기

Designed by Tistory.

티스토리툴바