본문 바로가기

전체 글

(13)

Bellman Equation and Reward Calculation in Grid world in 30 Seconds 1. In deterministics case, 2. In stochastic case, We met system error that In order to calculate the value of square, you need to know value of surrounding squares, vice versa RL's goalCalculate the value of each state: PredictionControl problem that finding optimal policy Dynamic programming algorithm to solve Reinforcement Learning problem1.Policy IterationIt starts estimating value for ea..

def f for loop in dfs bfs

Cross Product in 7D \documentclass{article} \usepackage{amsmath} \usepackage{amsfonts} \usepackage{amssymb} \begin{document} \section*{Cross Product in 3D} The cross product of two vectors $\mathbf{a} = (a_1, a_2, a_3)$ and $\mathbf{b} = (b_1, b_2, b_3)$ in 3D is defined as: \[ \mathbf{a} \times \mathbf{b} = \begin{vmatrix} \mathbf{i} & \mathbf{j} & \mathbf{k} \\ a_1 & a_2 & a_3 \\ b_1 & b_2 & b_3 \end{vmatrix} = \..

목록 더보기

티스토리툴바