๐Ÿฌ ML & Data/๐Ÿ“ฎ Reinforcement Learning

[MPC] 1. Model Predictive Control Intro

darly213 2024. 3. 6. 16:01
728x90

์œ ํŠœ๋ธŒ https://www.youtube.com/watch?v=zU9DxmNZ1ng&list=PLSAJDR2d_AUtkWiO_U-p-4VpnXGIorrO-&index=1
๋ธ”๋กœ๊ทธ https://sunggoo.tistory.com/65

์œ„ ์ž๋ฃŒ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ณต๋ถ€ํ•œ ๋‚ด์šฉ์„ ๊ฐ€๋ณ๊ฒŒ ์ •๋ฆฌํ•˜๋ ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค. ์ˆ˜์‹ ์ฆ๋ช…์ด ๋งŽ๊ฒ ๊ณ , ๊ทธ ๋’ค๋กœ๋Š” ๋ชฉ์ ์— ๋”ฐ๋ผ ๋…ผ๋ฌธ์ด๋‚˜ ์ฝ”๋“œ ๊ตฌํ˜„์„ ๋ณด๋ฉด์„œ ์ถ”๊ฐ€ํ•ด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค. 

MPC(Model Predictive Control)์˜ ์ปจ์…‰

  • ๊ธฐ๊ธฐ ์ƒํƒœ ๋ณ€ํ™”(dynamics) + ์ฃผ๋ณ€ ํ™˜๊ฒฝ ์š”์†Œ => cost function
  • ์ œ์–ด๊ณตํ•™
  • ๋น„์„ ํ˜• / ๋น„๋ณผ๋ก(Non-linear, Non-convex) ๋Œ€์ƒ
  • ๊ณต๋ถ€ํ•˜๋ฉด์„œ ๋А๋ผ๊ธฐ์—๋Š” ๊ฐ•ํ™”ํ•™์Šต์˜ ํ–ฅ๊ธฐ๊ฐ€ ์ข€ ์žˆ์Œ

 

Flow

์ถœ์ฒ˜: https://www.ist.uni-stuttgart.de/research/group-of-frank-allgoewer/model-predictive-control/

  • k-1 ์ผ ๋•Œ์˜ ์ƒํƒœ ๋ณ€์ˆ˜๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ k+1 ~ k+n๊นŒ์ง€์˜ ๋ฏธ๋ž˜ ์ œ์–ด ๋ช…๋ น์„ ๋ฏธ๋ฆฌ ๊ณ„ํšํ•จ
  • ๊ณ„ํšํ•œ ์ œ์–ด ๋ช…๋ น ์ถœ๋ ฅ
  • k์˜ ์ƒํƒœ ๋ณ€์ˆ˜ feedback
  • ์ œ์–ด ๋ช…๋ น์˜ ์ถœ๋ ฅ๊ณผ response๋ฅผ ๋น„๊ตํ•ด์„œ ํ‰๊ฐ€
  • cost funciton update
  • ๋‹ค์‹œ ๋ฏธ๋ž˜ ์ œ์–ด ๋ช…๋ น ๊ณ„ํš
728x90
๋Œ“๊ธ€์ˆ˜0