no image
6. Mobile
1. Wired core network는 wired로 이루어져 있음 2. Wireless Host laptop, smartphone.. Base station Wifi AP, Cellular AP.. wired network에 연결되어 있을 수 있음 wired wireless packet 전환 가능 Link multiple access protocol various data rates, transmission distance Infrastructure mode base station handoff - mobile 이동하면서 base station을 변경 Ad hoc mode no base station link로 연결된 node까지만 data 전송 가능 2.1. Wireless Link 특징 Decrea..
2023.05.01
5. Link layer
1. Link layer wired links, wireless links, LANs different link protocol over different link NIC (network interface card)를 통해 link layer 접속 Framing, link access MAC address가 header에 추가되어 destination에서 사용됨 Reliable delivery between adjacent nodes wired link에서는 bit-error가 low wireless link에서는 high bit-error Error detection noise에 의해 error가 발생함 receiver가 retransmisson이나 frame drop을 보고 error가 발생했음을 알..
2023.05.01
no image
4. Network layer
1. Forwarding vs Routing Forwarding Routing Data plane에서 앞으로 보내기 Control plane에서 길 찾기 Control plane을 바탕으로 Data plane에 routing 정보 기록 Data plane에서는 이것으로 보고 Forwarding 2. Per-Router Control Plane router마다 routing algorithm이 존재 Distributed control 3. Locally Centralized Control Plane local control agent가 remote controller에 정보 제공 Centralized control 4. Data plane 4.1. Forwarding Destination based des..
2023.05.01
no image
3. Transport layer
1. Socket programming 1.1. UDP no connection, out-of-order 전송 1.2. TCP connection 기반, in-order 전송, reliable 1.3. Latency 2. Transport layer vs Network layer Transport layer Network layer logical communication between process logical communication between host 3. Multiplexing vs Demultiplexing Multiplexing Demultiplexing 여러 message를 쪼개서 하나의 segment로 제작 segment를 쪼개서 각 message를 올바른 socket으로 전달 conn..
2023.05.01
no image
2. Application layer
1. Protocol HTTP web protocol, port 80, 주로 TCP(최근 UDP) FTP file 전송, TCP/IP SMTP mail 전송, port 25, TCP, POP3와 IMAP을 사용 POP3 TCP/IP를 이용해 mail을 가져오고 지움 IMAP TCP/IP를 이용해 mail을 가져옴 DNS Domain name-IP 관리, UDP 2. Socket process process마다 전용 socket을 생성하여 data를 주고 받음, port # 할당 IP address로 Host를 찾고 prot #로 process에 data를 전달 3. Application-service requirement Data integrity reliability, loss tolerate Timi..
2023.05.01
no image
1. Overview
​1. Queuing and Delay L: packet length, a: packet arrival rate, R: link bandwidth La/R ~ 0 | delay가 거의 없다 La/R ~ 1 | delay가 크다 La/R > 1 | loss 발생 2. Loss and Delay Total delay = Processing delay + Queuing delay + Transmission delay + Propagation delay 3. Encapsulation between layer 4. Internet history
2023.05.01
no image
[project] QMIX review
1. QMIX 원본 https://arxiv.org/abs/1803.11485 QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning In many real-world settings, a team of agents must coordinate their behaviour while acting in a decentralised way. At the same time, it is often possible to train the agents in a centralised fashion in a simulated or laboratory setting, where global state arxiv.org..
2023.05.01
no image
4. Monte Carlo Methods and Temporal Difference Learning in Policy Evaluation
RECALL * Policy Evaluation MDP와 policy $\pi$가 주어졌을 경우, Bellman equation을 이용해 Policy Evaluation을 할 수 있다. * DP for Policy Evaluation State-value function을 0으로 초기화하고, 수렴할때까지 state-value function을 업데이트했다. $\gamma
2023.05.01
no image
3. Policy Improvement by Iterative Method
1. Policy Iteration 초기 Policy에서 시작해서 Policy Evaluation, Policy Improvement를 반복하여 Optimal policy를 찾는 방법이다. 1.1. Policy Evaluation $Q^{\pi}(s,a) = R(s,a) + \gamma \sum_{s'}p(s'|s,a)V^{\pi}(s')$ $V^{\pi}(s) = \sum_{a} \pi(a|s)Q^{\pi}(s,a)$ State-value function이 $V_{\pi}$수렴할때까지 계속 evaluate하거나, 한번만 하고 Policy Improvement로 넘어가도 된다. 1.2. Policy Improvement (control) $Q^{\pi}(s,a) = R(s,a) + \gamma \sum_..
2023.05.01