Skip to content

SKT-AI-FELLOWSHIP-LIVE-TRACKING/live_tracking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

68 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

SKT AI fellowship 4๊ธฐ

Installation

pip install -r requirements.txt

Run the code

python web_demo.py

live_tracking_demo

  • Setting Academy Ratio
python web_demo.py --w 1.37 --h 1

์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ

์ตœ๊ทผ ์œ ํŠœ๋ธŒ, ํŠธ์œ„์น˜ ๋“ฑ ๋น„๋””์˜ค ์ŠคํŠธ๋ฆฌ๋ฐ ๋ถ„์•ผ๋Š” ๋ฏธ๋””์–ด ํ™˜๊ฒฝ์—์„œ ๋งค์šฐ ๋น ๋ฅด๊ฒŒ ์„ฑ์žฅํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ํŠนํžˆ ๋ผ์ด๋ธŒ ์ŠคํŠธ๋ฆฌ๋ฐ ๋ถ„์•ผ ์ƒ์œ„ 5๊ฐœ ์•ฑ์˜ ์ง€๋‚œ 3๋…„ ์—ฐํ‰๊ท  ์„ฑ์žฅ๋ฅ ์ด 25%๋ฅผ ๊ธฐ๋กํ•˜๋ฉด์„œ ์‚ฌ์ง„ ๋ฐ ๋น„๋””์˜ค ์•ฑ์˜ ์—ฐํ‰๊ท  ์„ฑ์žฅ๋ฅ  15%๋ฅผ ๋›ฐ์–ด๋„˜์—ˆ์Šต๋‹ˆ๋‹ค.

์ด์ฒ˜๋Ÿผ ๋ผ์ด๋ธŒ ์ฝ˜ํ…์ธ ๋ฅผ ์ œ์ž‘ํ•  ๋•Œ, ๋‹ค์–‘ํ•œ ์ œ์ž‘ ์†Œ์Šค๋ฅผ ์ž๋™ํ™”ํ•˜์—ฌ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ๋‹ค๋ฉด ์ฝ˜ํ…์ธ  ์ œ์ž‘์ž๋“ค์—๊ฒŒ ๋งค์šฐ ์œ ์šฉํ•œ ์„œ๋น„์Šค๊ฐ€ ๋  ๊ฒƒ์ž…๋‹ˆ๋‹ค.

์ €ํฌ์˜ ์—ฐ๊ตฌ์™€ ๋น„์Šทํ•œ ์ฃผ์ œ๋กœ๋Š” mediapipe์˜ Autoflip[1]๊ณผ apple์˜ Center Stage[2]๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

  • Autoflip

Autoflip์€ ์ž๋™์œผ๋กœ ๋น„๋””์˜ค๋ฅผ ๋ฆฌํ”„๋ ˆ์ž„ํ•ด์ฃผ๋Š” ์‹œ์Šคํ…œ์ž…๋‹ˆ๋‹ค. ํ”„๋ ˆ์ž„์„ ์›ํ•˜๋Š” ๋น„์œจ๋กœ ๋ฆฌํ”„๋ ˆ์ž„ํ•˜์—ฌ ๋‹ค์–‘ํ•œ ๋ชจ๋ฐ”์ผ ๊ธฐ๊ธฐ๋“ค์—์„œ๋„ ์ฝ˜ํ…์ธ ๋ฅผ ์ด์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ Autoflip์€ ์‹ค์‹œ๊ฐ„ ์˜์ƒ์—์„œ๋Š” ์‚ฌ์šฉํ•˜์ง€ ๋ชปํ•œ๋‹ค๋Š” ์ ์—์„œ ์ €ํฌ ์—ฐ๊ตฌ์™€์˜ ์ฐจ์ด์ ์„ ๊ฐ€์ง‘๋‹ˆ๋‹ค.

  • Center Stage

Center Stage๋Š” ์•„์ดํŒจ๋“œ์—์„œ ์˜์ƒ ํ†ตํ™”๋ฅผ ํ•  ๋•Œ ์‚ฌ๋žŒ์˜ ์–ผ๊ตด์„ ์ธ์‹ํ•˜์—ฌ ์ž๋™์œผ๋กœ ์ถ”์ ํ•˜๋Š” ์‹œ์Šคํ…œ์ž…๋‹ˆ๋‹ค. Center Stage๋Š” ํ•˜๋“œ์›จ์–ด๋กœ ์ถ”์  ์‹œ์Šคํ…œ์ด ์กฐ์ž‘๋˜๋Š” ๋ฐ˜๋ฉด ์ €ํฌ ์—ฐ๊ตฌ๋Š” ์†ŒํฌํŠธ์›จ์–ด๋กœ ์กฐ์ž‘๋˜๋Š” ์ ์ด ํฐ ์ฐจ์ด์ ์ž…๋‹ˆ๋‹ค. ๋˜ํ•œ ์ €ํฌ ์—ฐ๊ตฌ์—์„œ๋Š” ์–ผ๊ตด ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์‚ฌ๋žŒ ์ž์ฒด๋ฅผ ์ธ์‹ํ•œ๋‹ค๋Š” ์ ์ด ๋‹ค๋ฅด๋ฉฐ, ์ถ”๊ฐ€์ ์œผ๋กœ ์Šคํฌ์ธ  ๊ฒฝ๊ธฐ์—์„œ ๊ณต์„ ์ถ”์ ํ•˜๋Š” ๊ธฐ๋Šฅ๋„ ๊ตฌํ˜„ ์ค‘์— ์žˆ์Šต๋‹ˆ๋‹ค.

์ดˆ๊ธฐ ์ ‘๊ทผ ๋ฐฉ๋ฒ•

์ €ํฌ๋Š” ๋ผ์ด๋ธŒ ํŠธ๋ž˜ํ‚น ์‹œ์Šคํ…œ์„ (1) trackingํ•  ํ”ผ์‚ฌ์ฒด๋ฅผ ์ธ์‹, (2) ํ”ผ์‚ฌ์ฒด๋ฅผ ์ธ์‹ํ•œ ์ •๋ณด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ํ”„๋ ˆ์ž„ ๋ฆฌ์‚ฌ์ด์ง•, (3) ์›น์•ฑ์œผ๋กœ ๋ฆฌํ”„๋ ˆ์ž„ํ•œ ์˜์ƒ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์†ก์ถœํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค.

์ด ์ค‘ ์ €๋Š” 1๋ฒˆ๊ณผ 2๋ฒˆ์— ๋Œ€ํ•ด ์—ฐ๊ตฌํ•˜์˜€์Šต๋‹ˆ๋‹ค.

(1)-1 object detection

์šฐ์„  ์›ํ•˜๋Š” ํ”ผ์‚ฌ์ฒด๋ฅผ ์ž๋™์œผ๋กœ ๋”ฐ๋ผ๊ฐ€๋Š” ์‹œ์Šคํ…œ์„ ๋งŒ๋“ค๊ธฐ ์œ„ํ•ด์„œ๋Š” ํ•ด๋‹น ํ”ผ์‚ฌ์ฒด๋ฅผ ์ธ์‹ํ•˜๋Š” ๊ณผ์ •์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ object detection ๊ธฐ์ˆ ์„ ํ™œ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.

object detection์€ ์˜์ƒ์ด๋‚˜ ์ด๋ฏธ์ง€์—์„œ ์‚ฌ๋žŒ, ๋™๋ฌผ ๋“ฑ์˜ ์œ ์˜๋ฏธํ•œ ๊ฐ์ฒด๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•œ ์ปดํ“จํ„ฐ๋น„์ „ ๊ธฐ์ˆ ์ž…๋‹ˆ๋‹ค. object detection์€ ์ด๋ฏธ์ง€ ๋‚ด ํŠน์ • ์‚ฌ๋ฌผ์„ ๋ถ„๋ฅ˜ํ•˜๋Š” task์ธ image classification๊ณผ๋Š” ๋‹ค๋ฅด๊ฒŒ ํƒ์ง€๋œ ๊ฐ์ฒด์˜ ์ข…๋ฅ˜๋ฅผ ์ฐพ๊ณ (classification), ํ•ด๋‹น ๊ฐ์ฒด์˜ ์œ„์น˜(localization)๋ฅผ ์‚ฌ๊ฐํ˜•์˜ ํ˜•ํƒœ์ธ bounding box๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ฐพ๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-04 แ„‹แ…ฉแ„’แ…ฎ 9 31 11

object detection์— ์ฃผ๋กœ ์‚ฌ์šฉ๋˜๋Š” ๋ชจ๋ธ์ธ yolo์™€ SSD ์ค‘์—์„œ ๋น„๊ต์  ์†๋„๊ฐ€ ๋น ๋ฅธ SSD ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ฐ์ฒด๋ฅผ ํƒ์ง€ํ•˜๋Š” ๋ฐ์— ํ™œ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.

SSD

SSD[3] ๋ชจ๋ธ์€ ๊ฐ์ฒด ๊ฒ€์ถœ ๋ฐ ๋ถ„๋ฅ˜์™€ bounding box๋ฅผ ๊ตฌํ•˜๋Š” Region Proposal์ด ํ•œ ๋ฒˆ์— ์ด๋ฃจ์–ด์ง€๋Š” one-stage ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ์ด์ „๊นŒ์ง€์˜ two-stage ๋ชจ๋ธ๊ณผ๋Š” fps, ์ฆ‰ ์—ฐ์‚ฐ ์†๋„๊ฐ€ ๋” ๋น ๋ฅด๋‹ค๋Š” ์žฅ์ ์„ ์ง€๋‹ˆ๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ์‹ค์ œ๋กœ ๋Œ€ํ‘œ์ ์ธ two-stage ๋ชจ๋ธ์ธ Faster R-CNN์€ 7 fps์ธ ๋ฐ˜๋ฉด์— SSD ๋ชจ๋ธ์€ 59 fps์˜ ์†๋„๋ฅผ ์ง€๋‹ˆ๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-06 แ„‹แ…ฉแ„Œแ…ฅแ†ซ 1 04 09

SSD ๋ชจ๋ธ์€ ์—ฌ๋Ÿฌ ๊ฐœ์˜ feature map์—์„œ object detection์„ ์ˆ˜ํ–‰์‹œํ‚ค๋Š” ์ ์ด ๋‹ค๋ฅธ object detection ๋ชจ๋ธ๋“ค๊ณผ์˜ ์ฐจ์ด์ ์ž…๋‹ˆ๋‹ค. ์—ฌ๋Ÿฌ ๊ฐœ์˜ feature map์€ ๊ฐ๊ฐ ๋‹ค๋ฅธ ์‚ฌ์ด์ฆˆ๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ๋Š”๋ฐ ํฐ ์‚ฌ์ด์ฆˆ(๋†’์€ ํ•ด์ƒ๋„)์˜ feature map์—์„œ๋Š” ํฌ๊ธฐ๊ฐ€ ์ž‘์€ ๊ฐ์ฒด๋ฅผ ์ž˜ ์ธ์‹ํ•˜๊ณ , ์ž‘์€ ์‚ฌ์ด์ฆˆ(๋‚ฎ์€ ํ•ด์ƒ๋„)์˜ feature map์—์„œ๋Š” ํฌ๊ธฐ๊ฐ€ ํฐ ๊ฐ์ฒด๋ฅผ ์ž˜ ์ธ์‹ํ•ฉ๋‹ˆ๋‹ค. ๋‹ค์ˆ˜์˜ feature map์—์„œ ์–ป์€ bounding box ์ •๋ณด๋ฅผ NMS ์ฒ˜๋ฆฌ๋ฅผ ํ•˜๋ฉด์„œ ์ตœ์ข…์ ์œผ๋กœ ๊ฐ์ฒด์˜ ์ข…๋ฅ˜์™€ ๊ฐ์ฒด์˜ ์œ„์น˜๋ฅผ ํŒŒ์•…ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

(1)-2 face detection

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-06 แ„‹แ…ฉแ„Œแ…ฅแ†ซ 1 24 49

์–ผ๊ตด ์ธ์‹์€ mediapipe์—์„œ ์ œ๊ณตํ•˜๋Š” face detection api๋ฅผ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค. mediapipe์˜ face detection์€ BlazeFace[4]๋ผ๋Š” ๊ฒฝ๋Ÿ‰ํ™” face detection ๋ชจ๋ธ์„ ํ†ตํ•ด ์ด๋ฃจ์–ด์ง‘๋‹ˆ๋‹ค. ํ•ด๋‹น api๋ฅผ ํ†ตํ•ด ์ด๋ฏธ์ง€ ๋‚ด์—์„œ์˜ ๋‹ค์ค‘ ์–ผ๊ตด์„ ์ฐพ๊ณ , ๊ฐ ์–ผ๊ตด์˜ 6๊ฐœ์˜ ๋žœ๋“œ๋งˆํฌ(์–‘์ชฝ ๋ˆˆ, ์ฝ”, ์ž…, ์–‘์ชฝ ๊ท€)๋“ค์„ ์ฐพ์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

(2) reframing

๋‹ค์Œ์œผ๋กœ๋Š” ์•ž์—์„œ ์–ป์€ ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ๊ฐ€์ง€๊ณ  ์‚ฌ์šฉ์ž๊ฐ€ ์›ํ•˜๋Š” ๋น„์œจ์— ๋งž์ถฐ์„œ ํ”„๋ ˆ์ž„์„ ์ž˜๋ผ๋‚ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. ๋ฆฌํ”„๋ ˆ์ž„ ๋‹จ๊ณ„์—์„œ๋Š” ํ”ผ์‚ฌ์ฒด์˜ ์ •๋ณด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ž๋ฅผ ํ”„๋ ˆ์ž„์˜ ์ค‘์ ์„ ๊ตฌํ•˜๊ณ , ์ค‘์ ์„ ๊ธฐ์ค€์œผ๋กœ ์ž„์˜์˜ ๋น„์œจ์— ๋งž์ถฐ์„œ ์ž˜๋ผ๋‚ด๋„๋ก ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

(2)-1 ๋ฆฌํ”„๋ ˆ์ž„ ์ตœ์ ์ธ ์ค‘์  ๊ตฌํ•˜๊ธฐ

ํ”ผ์‚ฌ์ฒด๊ฐ€ ํ•˜๋‚˜์ธ ๊ฒฝ์šฐ์—๋Š” ์ธ์‹๋œ ํ”ผ์‚ฌ์ฒด์˜ ์ค‘์ ์ด ์ž๋ฅผ ํ”„๋ ˆ์ž„์˜ ์ค‘์ ์ด ๋˜๊ธฐ ๋•Œ๋ฌธ์— ๋ฆฌํ”„๋ ˆ์ž„ํ•  ์ค‘์ ์„ ๊ตฌํ•˜๋Š” ๊ฒƒ์ด ๋‹จ์ˆœํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ๋ฌธ์ œ๋Š” ํ”ผ์‚ฌ์ฒด๊ฐ€ ์—ฌ๋Ÿฌ ๊ฐœ์ผ ๊ฒฝ์šฐ์— ๋ฐœ์ƒํ•˜์˜€์Šต๋‹ˆ๋‹ค. ํ”ผ์‚ฌ์ฒด๊ฐ€ ์—ฌ๋Ÿฌ ๊ฐœ์ผ ๊ฒฝ์šฐ ์ƒํ™ฉ์— ๋”ฐ๋ผ์„œ๋Š” ์ตœ๋Œ€ํ•œ ๋งŽ์€ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ž๋ฅผ ํ”„๋ ˆ์ž„์— ๋‹ด์•„๋‚ด๊ฑฐ๋‚˜ ํ˜น์€ ๊ณผ๊ฐํžˆ ๋ช‡๋ช‡ ํ”ผ์‚ฌ์ฒด๋Š” ๋ฒ„๋ ค์•ผ ํ–ˆ์Šต๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๊ฐ€์žฅ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ํ”ผ์‚ฌ์ฒด(mAP score๊ฐ€ ๊ฐ€์žฅ ํฐ ํ”ผ์‚ฌ์ฒด)๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ๋ฆฌํ”„๋ ˆ์ž„์„ ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ ํ•ด๋‹น ํ”ผ์‚ฌ์ฒด ๊ทผ์ฒ˜์— ์žˆ๋Š” ๋‹ค๋ฅธ ํ”ผ์‚ฌ์ฒด๋“ค์ด ์กด์žฌํ•˜๊ณ , ํ”ผ์‚ฌ์ฒด ์‚ฌ์ด์˜ ๊ฑฐ๋ฆฌ๊ฐ€ ๋ฉ€์ง€ ์•Š๋‹ค๊ณ  ํŒ๋‹จ๋  ๋•Œ ์ด๋“ค์„ ๋ฆฌํ”„๋ ˆ์ž„ ์‹œ ๊ฐ™์ด ๋‹ด์•„๋‚ด๋ ค๊ณ  ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์ตœ์  ์ค‘์  ๊ตฌํ•˜๊ธฐ ๊ด€๋ จ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์„ค๋ช…์€ ์•„๋ž˜์—์„œ ๋” ์ž์„ธํžˆ ๋‹ค๋ฃจ๊ฒ ์Šต๋‹ˆ๋‹ค.

Face / Object detection์œผ๋กœ ์ธ์‹ํ•œ ์ •๋ณด ์ค‘์—์„œ mAP ์Šค์ฝ”์–ด๊ฐ€ ๊ฐ€์žฅ ํฐ ํ”ผ์‚ฌ์ฒด๋ฅผ list์— ์ €์žฅํ•ฉ๋‹ˆ๋‹ค. ์ดํ›„ ์ž„์˜์˜ ๊ฒฝ๊ณ„ ๋ฒ”์œ„(target width / height์˜ ์ ˆ๋ฐ˜์œผ๋กœ ์„ค์ •) ์•ˆ์— ๋‹ค๋ฅธ detection์˜ ์ค‘์‹ฌ ์ขŒํ‘œ๊ฐ€ ๋“ค์–ด์˜ฌ ์ˆ˜ ์žˆ๋Š” ๊ฒฝ์šฐ์— ํ•ด๋‹น ์ •๋ณด๋ฅผ list์— ์ €์žฅํ•˜๊ณ  ๊ฒฝ๊ณ„ ๋ฒ”์œ„๋ฅผ updateํ•ฉ๋‹ˆ๋‹ค. ์•ž์˜ ๊ณผ์ •์„ ๋ชจ๋‘ ๋งˆ์นœ ํ›„ list์— ์ €์žฅ๋œ detection ์ค‘์‹ฌ ์ขŒํ‘œ๋“ค์˜ ํ‰๊ท ์„ ๊ตฌํ•˜๋ฉด ๋ฆฌํ”„๋ ˆ์ž„์— ์ตœ์ ์ธ ์ค‘์ ์„ ์–ป๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-04 แ„‹แ…ฉแ„’แ…ฎ 11 25 16

์œ„์˜ ์‚ฌ์ง„์œผ๋กœ ์˜ˆ์‹œ๋ฅผ ๋“ค๊ฒ ์Šต๋‹ˆ๋‹ค. ๊ฐ ํ”ผ์‚ฌ์ฒด์˜ id๋Š” mAP score๊ฐ€ ๋†’์€ ์ˆœ์„œ๋Œ€๋กœ ๋ถ€์—ฌํ•˜์˜€๊ณ , ์™ผ์ชฝ์— ํŒŒ๋ž€ ๋ง‰๋Œ€๋Š” ์ž„์˜์˜ ๊ฒฝ๊ณ„ ๋ฒ”์œ„์ž…๋‹ˆ๋‹ค. 1๋ฒˆ์„ ๊ธฐ์ค€์œผ๋กœ ๊ฒฝ๊ณ„ ๋ฒ”์œ„ ์•ˆ์— 2๋ฒˆ, 3๋ฒˆ, 4๋ฒˆ ํ”ผ์‚ฌ์ฒด๊ฐ€ ๋“ค์–ด์˜ฌ ์ˆ˜ ์žˆ๋Š”์ง€ ์ฐจ๋ก€๋Œ€๋กœ ๋น„๊ตํ•ฉ๋‹ˆ๋‹ค. ๋งŒ์•ฝ์— ์œ„์˜ ์ƒํ™ฉ์—์„œ 2๋ฒˆ ํ”ผ์‚ฌ์ฒด์™€ 1๋ฒˆ ํ”ผ์‚ฌ์ฒด์™€์˜ ๊ฑฐ๋ฆฌ๊ฐ€ ์„ค์ •ํ•œ ๊ฒฝ๊ณ„ ๋ฒ”์œ„ ๋ณด๋‹ค ์ž‘๋‹ค๊ณ  ๊ฐ€์ •ํ•˜๋ฉด 2๋ฒˆ ํ”ผ์‚ฌ์ฒด๋„ ์ž๋ฅผ ํ”„๋ ˆ์ž„์— ๋‹ด์•„๋‚ด๋„๋ก ํ•˜๊ณ (2๋ฒˆ ํ”ผ์‚ฌ์ฒด์˜ ์ค‘์  ์ขŒํ‘œ๋ฅผ list์— ์ €์žฅ) ๊ฒฝ๊ณ„ ๋ฒ”์œ„๋ฅผ 2๋ฒˆ ํ”ผ์‚ฌ์ฒด์™€ 1๋ฒˆ ํ”ผ์‚ฌ์ฒด์˜ ๊ฑฐ๋ฆฌ๋งŒํผ ๋นผ์ฃผ๋ฉด์„œ updateํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค. 3๋ฒˆ, 4๋ฒˆ ํ”ผ์‚ฌ์ฒด๋Š” 1, 2๋ฒˆ ํ”ผ์‚ฌ์ฒด์™€์˜ ๊ฑฐ๋ฆฌ๊ฐ€ update๋œ ๊ฒฝ๊ณ„ ๋ฒ”์œ„ ๋ณด๋‹ค ํฌ๋‹ค๊ณ  ๊ฐ€์ •ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-04 แ„‹แ…ฉแ„’แ…ฎ 11 32 31

์œ„์˜ ์ƒํ™ฉ๋Œ€๋กœ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ฒ˜๋ฆฌ๋ฅผ ๋งˆ์น˜๋ฉด list์— ์ €์žฅ๋œ ์ค‘์  ์ขŒํ‘œ๋“ค์˜ ํ‰๊ท ์ด ๋ฆฌํ”„๋ ˆ์ž„ ์ตœ์  ์ค‘์ ์ด ๋ฉ๋‹ˆ๋‹ค. ํ•ด๋‹น ์ตœ์  ์ค‘์ ์„ ํ†ตํ•ด ๋ฆฌํ”„๋ ˆ์ž„ํ•˜๋ฉด ์‚ฌ์ง„์—์„œ ๋ณด์ด๋Š” ๋นจ๊ฐ„์ƒ‰ ๋ฐ•์Šค์™€ ๊ฐ™์€ ํ”„๋ ˆ์ž„์„ ์–ป์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

(2)-2 ํ”„๋ ˆ์ž„ ์ž˜๋ผ๋‚ด๊ธฐ

๋ฆฌํ”„๋ ˆ์ž„์„ ํ•  ๋•Œ๋Š” ํ”ผ์‚ฌ์ฒด๋ฅผ zoomingํ•˜์ง€ ์•Š๊ณ  ํŠน์ • ๋น„์œจ์—์„œ ๊ธฐ์กด ํ”„๋ ˆ์ž„์ด ์ตœ๋Œ€ํ•œ ๋‹ด๊ธธ ์ˆ˜ ์žˆ๋„๋ก ์„ค์ •ํ•˜์—ฌ ํ”„๋ ˆ์ž„์ด ๊นจ์ง€๋Š” ์ƒํ™ฉ์ด ๋ฐœ์ƒํ•˜์ง€ ์•Š๋„๋ก ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

์˜ˆ๋ฅผ ๋“ค์–ด ์›๋ณธ ํ”„๋ ˆ์ž„์˜ ์‚ฌ์ด์ฆˆ๊ฐ€ 1024x720์ด๊ณ  5:4๋กœ ์ž˜๋ผ์•ผ ํ•˜๋Š” ๊ฒฝ์šฐ, ์›๋ณธ ํ”„๋ ˆ์ž„์˜ ๋น„์œจ(64/45=1.4222โ€ฆ)์ด ๋ฆฌํ”„๋ ˆ์ž„ํ•˜๋ ค๋Š” ๋น„์œจ(5/4=1.25)๋ณด๋‹ค ํฌ๊ธฐ ๋•Œ๋ฌธ์— ์›๋ณธ ํ”„๋ ˆ์ž„์˜ ๋„“์ด ๋ถ€๋ถ„์„ ์ž˜๋ผ๋‚ด์–ด 5:4์˜ ๋น„์œจ์„ ๋งž์ถ”๋Š” ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค.

(2)-3 ํ”„๋ ˆ์ž„ ๋ณด๊ฐ„

์ธ์‹๋œ ํ”ผ์‚ฌ์ฒด๊ฐ€ ์›€์ง์ด๊ฑฐ๋‚˜, ํ”ผ์‚ฌ์ฒด์˜ ๊ฐœ์ˆ˜๊ฐ€ ๋ฐ”๋€Œ๋Š” ๋“ฑ์˜ ๊ฒฝ์šฐ์—์„œ๋Š” ์ฒ˜๋ฆฌํ•˜๊ณ ์ž ํ•˜๋Š” ์ค‘์ ์˜ ์ขŒํ‘œ ๊ฐ’์ด ์ด์ „ ํ”„๋ ˆ์ž„์— ๋น„ํ•ด ํฌ๊ฒŒ ๋‹ฌ๋ผ์ง‘๋‹ˆ๋‹ค. ์ด๋•Œ ํŠน๋ณ„ํ•œ ์ฒ˜๋ฆฌ ์—†์ด ๊ทธ๋Œ€๋กœ ์ž˜๋ผ๋‚ธ ํ”„๋ ˆ์ž„๋“ค์„ ์†ก์ถœํ•  ๊ฒฝ์šฐ ์‚ฌ์šฉ์ž์˜ ์ž…์žฅ์—์„œ ํ™”๋ฉด์ด ๋งค์šฐ ํ”๋“ค๋ฆฌ๋Š” ๊ฒƒ์œผ๋กœ ๋ณด์ด๊ฒŒ ๋ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ํ”„๋ ˆ์ž„ ๋ณด๊ฐ„๋ฒ•์„ ๊ณ ์•ˆํ•˜์˜€์Šต๋‹ˆ๋‹ค.

๋ณด๊ฐ„์ด๋ž€ ๊ธฐ์กด์— ์•Œ๋˜ ์–ด๋– ํ•œ ๋‘ ์ง€์  ์‚ฌ์ด์— ์œ„์น˜ํ•œ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ์˜ ๊ฐ’์„ ์ถ”์ •ํ•˜๋Š” ๊ฒƒ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-04 แ„‹แ…ฉแ„’แ…ฎ 11 51 00

์œ„์˜ ์‚ฌ์ง„์€ ์„ ํ˜• ๋ณด๊ฐ„๋ฒ•์˜ ์˜ˆ์‹œ๋กœ, ๋์ ์ธ (x0, y0), (x1, y1)์˜ ๊ฐ’์ด ์ฃผ์–ด์กŒ์„ ๋•Œ ์ง์„  ๊ฑฐ๋ฆฌ์— ๋”ฐ๋ผ ์„ ํ˜•์ ์œผ๋กœ ๊ณ„์‚ฐํ•˜์—ฌ ๊ทธ ์‚ฌ์ด์— ์œ„์น˜ํ•œ ๊ฐ’ (x, y)๋ฅผ ์ถ”์ •ํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

ํ”„๋ ˆ์ž„ ๋ณด๊ฐ„๋ฒ•์€ euclidean norm์œผ๋กœ ํ•œ ํ”„๋ ˆ์ž„์˜ ์ขŒํ‘œ์™€ ๋‹ค๋ฅธ ํ”„๋ ˆ์ž„์˜ ์ขŒํ‘œ์˜ ์ง์„  ๊ฑฐ๋ฆฌ linearํ•˜๊ฒŒ ๋‚˜ํƒ€๋‚ธ ํ›„์— ํ•œ ํ”„๋ ˆ์ž„์˜ timestamp์™€ ๋‹ค๋ฅธ ํ”„๋ ˆ์ž„์˜ timestamp ์‚ฌ์ด์— ์กด์žฌํ•˜๋Š” timestamp์— ํ•ด๋‹นํ•˜๋Š” ์ขŒํ‘œ ๊ฐ’์„ ์ถ”์ •ํ•˜์—ฌ ์ด๋ฃจ์–ด์ง‘๋‹ˆ๋‹ค. ์ฆ‰ ํ•œ ํ”„๋ ˆ์ž„์—์„œ ๋‹ค์Œ ํ”„๋ ˆ์ž„์ด ์†ก์ถœ๋˜๊ธฐ ์ „์— ๋ณด๊ฐ„ํ•œ ์ขŒํ‘œ ๊ฐ’์„ ์ค‘์‹ฌ์œผ๋กœ cropํ•œ ํ”„๋ ˆ์ž„๋“ค์„ ์ฑ„์›Œ ๋„ฃ์–ด์„œ ๋” ์ž์—ฐ์Šค๋Ÿฌ์šด ์˜์ƒ์ด ๋ณด์—ฌ์ง€๊ฒŒ ํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-05 แ„‹แ…ฉแ„Œแ…ฅแ†ซ 12 03 32

ํ•˜์ง€๋งŒ ๊ธฐ์กด์˜ ๋ณด๊ฐ„๋ฒ•์„ ๋ผ์ด๋ธŒ ํŠธ๋ž˜ํ‚น์— ์‚ฌ์šฉํ•˜๊ธฐ์—๋Š” ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒํ•˜์˜€์Šต๋‹ˆ๋‹ค.

euclidean norm ๋ณด๊ฐ„ ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•  ๋•Œ Key frame๋“ค์˜ ์ขŒํ‘œ๋“ค๊ณผ timestamp๋ฅผ ํ™œ์šฉํ•˜์—ฌ ๋ณด๊ฐ„ํ•˜๋Š”๋ฐ, ์‹ค์‹œ๊ฐ„์—์„  ์˜์ƒ ๋ฐ์ดํ„ฐ์™€ ๋‹ฌ๋ฆฌ ๋‹ค์Œ์œผ๋กœ ์–ป์–ด์˜ค๋Š” ํ”„๋ ˆ์ž„๋“ค์„ ์•Œ ์ˆ˜ ์—†๊ธฐ ๋•Œ๋ฌธ์— ํ•ด๋‹น ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์—†์—ˆ์Šต๋‹ˆ๋‹ค.

๋”ฐ๋ผ์„œ ์ด์ „ ํ”„๋ ˆ์ž„๊ณผ ํ˜„์žฌ ํ”„๋ ˆ์ž„์˜ ์ค‘์‹ฌ ์ขŒํ‘œ๋ฅผ ๋น„๊ตํ•˜๋Š” online ๋ฐฉ์‹์œผ๋กœ ํ”„๋ ˆ์ž„์„ ๋ณด๊ฐ„ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ timestamp๊ฐ€ ์—†๋Š” ๋ฌธ์ œ๋Š” ์ด์ „ ํ”„๋ ˆ์ž„๊ณผ ํ˜„์žฌ ํ”„๋ ˆ์ž„ ์‚ฌ์ด์— ์ฑ„์›Œ ๋„ฃ๋Š” ๋ณด๊ฐ„ ํ”„๋ ˆ์ž„์˜ ๊ฐœ์ˆ˜๋ฅผ ์ž„์˜๋กœ ์ •ํ•˜์—ฌ ํ•ด๊ฒฐํ•˜์˜€์Šต๋‹ˆ๋‹ค.

UPDATE !! (in 2024.08)

๋ณด๊ฐ„ ํ”„๋ ˆ์ž„์„ ์ด์ „ ํ”„๋ ˆ์ž„๊ณผ ํ˜„์žฌ ํ”„๋ ˆ์ž„ ์‚ฌ์ด์— ๋ผ์›Œ ๋„ฃ์–ด์„œ ๋ณด๊ฐ„ํ•˜๋Š” ๊ฒƒ์€ Web Application์—์„œ ์ž‘๋™ํ•˜๋Š” ๋ฐ ๋ฌธ์ œ๋ฅผ ๋ฐœ์ƒ์‹œํ‚ต๋‹ˆ๋‹ค.. ํ•œ ๊ฐœ ์ด์ƒ์˜ ํ”„๋ ˆ์ž„์„ ์ง€์†์ ์œผ๋กœ ์‚ฝ์ž…ํ•˜๊ฒŒ ๋˜๋ฉด, Web์—์„œ ํ”„๋ ˆ์ž„์„ ์ฒ˜๋ฆฌํ•˜๋Š” ์—ฐ์‚ฐ ์‹œ๊ฐ„์„ ์ดˆ๊ณผํ•˜๊ฒŒ ๋ฉ๋‹ˆ๋‹ค. ์ด์— ๋”ฐ๋ผ ํ•„์š”ํ•œ ํ”„๋ ˆ์ž„๋“ค์„ ์†ก์ถœํ•˜๋Š” ๊ฒƒ์ด ๋ถˆ๊ฐ€๋Šฅํ•ด์ง€๋ฉฐ ๋ช‡๋ช‡ ํ”„๋ ˆ์ž„๋“ค์ด drop๋˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ•ฉ๋‹ˆ๋‹ค.

๋”ฐ๋ผ์„œ interpolation ๋ฐฉ๋ฒ•์„ ์ด์ „ frame step์—์„œ์˜ key coord์™€ ํ˜„์žฌ frame step (detection + tracking์œผ๋กœ ๊ตฌํ•œ) key coord ์‚ฌ์ด์— linear interpolation์„ ์ ์šฉํ•˜์—ฌ ํ˜„์žฌ frame step์—์„œ์˜ key coord๋ฅผ updateํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๋ณ€๊ฒฝํ•˜์˜€์Šต๋‹ˆ๋‹ค.

$$ optimal center = (1 - \alpha) * prev center + \alpha * current center $$

  • $\alpha$ : fps์— ๋”ฐ๋ผ ์กฐ์ •๋˜๋Š” parameter
  • $\alpha$ = clamp(min=0.01, max=1.0, fps)

-> ์ด์ „ ๋ฐฉ๋ฒ•๋ณด๋‹ค framing ์•ˆ์ •์„ฑ์ด ๋งค์šฐ ๊ฐœ์„ ๋˜์—ˆ์Šต๋‹ˆ๋‹ค!!

๋ฌธ์ œ ์‹๋ณ„

  • ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด ๋ณ€ํ™”์— ๋”ฐ๋ฅธ ํ”„๋ ˆ์ž„ ํ”๋“ค๋ฆผ

ํ”ผ์‚ฌ์ฒด์˜ ์ˆ˜๊ฐ€ 1๋ช…์—์„œ 2๋ช…์ด ๋  ๋•Œ, ํ˜น์€ ์ฒ˜์Œ๋ถ€ํ„ฐ ์—ฌ๋Ÿฌ ๋ช…์ด ์žˆ์„ ๋•Œ ํ”„๋ ˆ์ž„์ด ๋งค์šฐ ํ”๋“ค๋ฆฌ๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ•˜์˜€์Šต๋‹ˆ๋‹ค.

์ธ์‹๋œ ํ”ผ์‚ฌ์ฒด์˜ ์ˆ˜๊ฐ€ 2 ์ด์ƒ์ผ ๋•Œ ์ž„์˜์˜ ๋ฒ”์œ„๋ฅผ ๋‘์–ด ์ฒ˜๋ฆฌํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์—์„œ ์„ค์ •ํ•œ ๋ฒ”์œ„์— ์–ด๋–ค ํ”ผ์‚ฌ์ฒด๊ฐ€ ๊ฑธ์ณ ์žˆ์„ ๊ฒฝ์šฐ ํ”ผ์‚ฌ์ฒด๋ฅผ ์žก์•˜๋‹ค๊ฐ€ ์•ˆ ์žก๋Š” ์ƒํ™ฉ์ด ๋ฐ˜๋ณต๋˜๋ฉด์„œ ํ”„๋ ˆ์ž„์ด ํ”๋“ค๋ฆฌ๊ฒŒ ๋˜๋Š” ๊ฒƒ์ด์—ˆ์Šต๋‹ˆ๋‹ค.

  • object detection ๋ชจ๋ธ์˜ ์ •ํ™•๋„๊ฐ€ ๋‚ฎ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-08 แ„‹แ…ฉแ„’แ…ฎ 9 55 18

fps๋ฅผ 30์ด์ƒ์œผ๋กœ ๊ตฌํ˜„ํ•˜๊ธฐ ์œ„ํ•ด์„œ ์†๋„๊ฐ€ ๋น ๋ฅธ object detection ๋ชจ๋ธ์ธ SSD๋ฅผ ์‚ฌ์šฉํ•˜๋‹ค ๋ณด๋‹ˆ, ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ์ข‹์ง€ ์•Š์•˜์Šต๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๊ฐ์ฒด๋ฅผ ์ธ์‹ํ•˜์ง€ ๋ชปํ•˜๊ฑฐ๋‚˜, ๊ฐ์ฒด์˜ ์œ„์น˜์— ๋”ฐ๋ฅธ bounding box๋ฅผ ์ œ๋Œ€๋กœ ๊ทธ๋ฆฌ์ง€ ๋ชปํ•˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ•˜์˜€์Šต๋‹ˆ๋‹ค.

์ ‘๊ทผ ๋ฐฉ๋ฒ• / ํ•ด๊ฒฐ

์ด์ „๊นŒ์ง€๋Š” ๋งค ํ”„๋ ˆ์ž„๋งˆ๋‹ค object detection์„ ํ•˜์—ฌ ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ํŒŒ์•…ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์œ„์˜ ๋ฐฉ์‹์ด ํšจ์œจ์ ์ด์ง€ ์•Š๋‹ค๊ณ  ์ƒ๊ฐํ•˜์˜€์Šต๋‹ˆ๋‹ค.

๋˜ํ•œ fps 30์ด์ƒ์œผ๋กœ ๋งŒ๋“ค์–ด์•ผ ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๋น„๊ต์  ์†๋„๊ฐ€ ๋น ๋ฅธ object detection ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์˜€๋˜ ๊ฒƒ์ธ๋ฐ ๊ผญ ๋ชจ๋ธ์˜ ์†๋„๋ฅผ ๋นจ๋ฆฌํ•˜์—ฌ fps 30์ด์ƒ์„ ๊ตฌํ˜„ํ•  ํ•„์š”๊ฐ€ ์—†๋‹ค๋Š” ๊ฒƒ์„ ์•Œ๊ฒŒ ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

๊ธฐ์กด ๋ฐฉ์‹์„ ๋ฐ”๊พธ์–ด ํ•œ ํ”„๋ ˆ์ž„์—์„œ object detection์„ ํ•˜๋ฉด ์ดํ›„์˜ ํ”„๋ ˆ์ž„์—์„œ๋Š” ๋‹ค๋ฅธ ๋ฐฉ์‹์œผ๋กœ ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ํŒŒ์•…ํ•˜๋Š” ๊ฒƒ์„ ๊ณ ๋ คํ•˜์˜€๊ณ  ์ด๋•Œ object tracking์„ ์‚ฌ์šฉํ•˜๊ธฐ๋กœ ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

multiple object tracking(MOT)

multiple object tracking์€ ๋‹ค์ˆ˜์˜ ๊ฐ์ฒด๋ฅผ ์ถ”์ ํ•˜๋Š” ๊ฒƒ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค. object tracking์€ ์‹œ์ž‘ ํ”„๋ ˆ์ž„์˜ bounding box ์ขŒํ‘œ๋งŒ์œผ๋กœ trackingํ•˜๋Š” Detection-Free-Tracking๊ณผ object detector๋กœ ์–ป๋Š” bounding box ์ขŒํ‘œ๋กœ trackingํ•˜๋Š” Detection-Based-Tracking์œผ๋กœ ๋‚˜๋‰˜๋Š”๋ฐ, ๋Œ€๋ถ€๋ถ„ Detection-Based-Tracking ๋ฐฉ์‹์œผ๋กœ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค.

๋˜ํ•œ ์ „์ฒด ํ”„๋ ˆ์ž„์˜ detection ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ tracking trajactory๋ฅผ ๋งŒ๋“œ๋Š” batch tracking ๋ฐฉ์‹๊ณผ ์ด์ „ ํ”„๋ ˆ์ž„๊ณผ ํ˜„์žฌ ํ”„๋ ˆ์ž„์˜ detection ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ trackingํ•˜๋Š” online tracking ๋ฐฉ์‹์ด ์กด์žฌํ•˜๋Š”๋ฐ, ์‹ค์‹œ๊ฐ„์œผ๋กœ ๊ตฌํ˜„ํ•ด์•ผ ํ•˜๋Š” ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์— ๋งž๋„๋ก online tracking ๋ฐฉ์‹์„ ์ฑ„ํƒํ•˜์˜€์Šต๋‹ˆ๋‹ค.

object tracking์—์„œ ์ฃผ๋กœ ์‚ฌ์šฉ๋˜๋Š” ๊ธฐ์ˆ ์€ SORT(simple online real-time tracker)์ž…๋‹ˆ๋‹ค.

SORT

SORT[5]๋Š” kalman filter์™€ ํ—๊ฐ€๋ฆฌ์•ˆ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•˜์—ฌ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ถ”์ ํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์ž…๋‹ˆ๋‹ค.

  • Kalman Filter

์นผ๋งŒ ํ•„ํ„ฐ๋Š” ์˜์ƒ ๋‚ด ๊ฐ์ฒด์˜ ์›€์ง์ž„์ด ์„ ํ˜•์ ์ด๋ผ๊ณ  ๊ฐ€์ •ํ•˜๊ณ (์˜์ƒ ๋‚ด์—์„œ ๊ฐ์ฒด๊ฐ€ ๊ฐ‘์ž๊ธฐ ์‚ฌ๋ผ์ง€๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งค์šฐ ์ ๊ธฐ ๋•Œ๋ฌธ), ์ด์ „ ๊ฐ์ฒด์˜ ์œ„์น˜ ๋ฐ ์†๋„๋ฅผ ๊ณ„์‚ฐํ•˜์—ฌ ํ˜„์žฌ ํ”„๋ ˆ์ž„์˜ ๊ฐ์ฒด ์œ„์น˜๋ฅผ ํ™•๋ฅ ์ ์œผ๋กœ ์ถ”์ •ํ•ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-06 แ„‹แ…ฉแ„Œแ…ฅแ†ซ 11 14 42

  • uย : ์ด์ „ ํ”„๋ ˆ์ž„์—์„œ์˜ ๊ฐ์ฒด ๊ฐ€๋กœ ์ค‘์  ์œ„์น˜
  • vย : ์ด์ „ ํ”„๋ ˆ์ž„์—์„œ์˜ ๊ฐ์ฒด ์„ธ๋กœ ์ค‘์  ์œ„์น˜
  • sย : ๊ฐ์ฒด์˜ bounding box์˜ scale
  • rย : ๊ฐ์ฒด์˜ bounding box์˜ ๋น„์œจ (๊ฐ€๋กœ / ์„ธ๋กœ)

u, v, s, r ์˜ ๊ฐ’์„ ์„ ํ˜• ๋“ฑ์† ๋ชจ๋ธ์„ ํ†ตํ•ด ํ˜„์žฌ ํ”„๋ ˆ์ž„์˜ ๊ฐ์ฒด์˜ ์ƒํƒœ(u', v' , s')๋ฅผ ์ถ”์ ํ•ฉ๋‹ˆ๋‹ค. ์ด๋•Œ bounding box์˜ ๋น„์œจ์€ ์ผ์ •ํ•˜๋‹ค๊ณ  ๊ฐ€์ •ํ•ฉ๋‹ˆ๋‹ค.

  • ํ—๊ฐ€๋ฆฌ์•ˆ ์•Œ๊ณ ๋ฆฌ์ฆ˜
  1. ์นผ๋งŒ ํ•„ํ„ฐ๋ฅผ ํ†ตํ•ด ์ด์ „ ํ”„๋ ˆ์ž„๊นŒ์ง€์˜ ๊ฐ์ฒด ์œ„์น˜๋ฅผ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.

  2. ํ˜„์žฌ ํ”„๋ ˆ์ž„์˜ ๊ฐ์ฒด ์œ„์น˜ ์ •๋ณด๋ฅผ detector๋กœ ์•Œ์•„๋ƒ…๋‹ˆ๋‹ค.

  3. ํ—๊ฐ€๋ฆฌ์•ˆ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ†ตํ•ด ์นผ๋งŒ ํ•„ํ„ฐ๋กœ ์˜ˆ์ธกํ•œ ๊ฐ’๊ณผ detector๋กœ ์ธ์‹ํ•œ ๊ฐ’๋“ค์„ ๊ฑฐ๋ฆฌ๊ฐ€ ๊ฐ€๊นŒ์šด ๊ฒƒ๋ผ๋ฆฌ ๋งค์นญํ•ฉ๋‹ˆ๋‹ค.

์œ„์˜ ๊ณผ์ •์„ ํ†ตํ•ด ํ˜„์žฌ ์ด๋ฏธ์ง€ ๋‚ด์—์„œ์˜ ๊ฐ์ฒด ์ •๋ณด๋ฅผ ํŒŒ์•…ํ•˜๊ณ  ์ถ”์ ํ•˜๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-27 แ„‹แ…ฉแ„’แ…ฎ 4 14 52

Deep SORT[6]๋Š” ๊ธฐ์กด SORT ๋ฐฉ์‹์—์„œ Matching cascade์™€ Person Re-Identification์„ ์ถ”๊ฐ€ํ•œ ๊ฒƒ์ž…๋‹ˆ๋‹ค.

(Matching cascade์™€ Person Re-Identification์€ ๋ถ€๋ก์—์„œ ์„ค๋ช…)

FastMOT ์‚ฌ์šฉ ์ด์œ  ๋ฐ ์›๋ฆฌ

SORT์™€ DeepSORT ๋“ฑ์˜ two-stage tracker๋Š” ๋งค ํ”„๋ ˆ์ž„๋งˆ๋‹ค object detector์˜ detection ์ธ์‹ ์ •๋ณด๋ฅผ ํ•„์š”๋กœ ํ•ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ object detector๋ฅผ ๋” ์„ฑ๋Šฅ์ด ์ข‹์€ ๊ฒƒ์œผ๋กœ ๋ฐ”๊พธ๊ฒŒ ๋œ๋‹ค๋ฉด, ์†๋„๊ฐ€ ๋” ๋А๋ฆฐ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•ด์•ผ ํ•˜๋ฏ€๋กœ ๋งค ํ”„๋ ˆ์ž„๋งˆ๋‹ค object detection์„ ํ•  ๊ฒฝ์šฐ์— fps๊ฐ€ 30์ด์ƒ์ธ ์ƒํƒœ๋ฅผ ์œ ์ง€ํ•˜๊ธฐ ์–ด๋ ต๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

๋”ฐ๋ผ์„œ ๋งค ํ”„๋ ˆ์ž„๋งˆ๋‹ค object detection์„ ํ•˜์ง€ ์•Š์•„๋„ object tracking ํ•˜๋Š” ๊ฒƒ์ด ๊ฐ€๋Šฅํ•œ FastMOT๋ฅผ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.

FastMOT๋Š” N๊ฐœ์˜ ํ”„๋ ˆ์ž„๋งˆ๋‹ค object detection์„ ํ•˜๊ณ , ๊ทธ ๊ณต๋ฐฑ์„ ์นผ๋งŒ ํ•„ํ„ฐ๋ฅผ ํ†ตํ•ด ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์ž‘๋™ํ•ฉ๋‹ˆ๋‹ค.

์นผ๋งŒ ํ•„ํ„ฐ์˜ ์˜ˆ์ธก ์ •ํ™•๋„๋ฅผ ๋” ๋†’์ด๊ธฐ ์œ„ํ•ด ์ด์ „ ํ”„๋ ˆ์ž„๊ณผ ํ˜„์žฌ ํ”„๋ ˆ์ž„์˜ optical flow๋ฅผ ๊ณ„์‚ฐํ•œ ํ›„์— ์นผ๋งŒ ํ•„ํ„ฐ๋ฅผ ์ ์šฉํ•˜๋Š” ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•˜์—ฌ object detector ์—†์ด๋„ tracking ํ”„๋ ˆ์ž„์—์„œ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ถ”์ ํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-08-19 แ„‹แ…ฉแ„’แ…ฎ 7 29 45

Sparse frame processing

์ฒซ ๋ฒˆ์งธ ํ”„๋ ˆ์ž„์—์„œ object detector๋ฅผ ํ†ตํ•ด ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ์ธ์‹ํ•œ ํ›„ object tracker์— ํ•ด๋‹น ์ •๋ณด๋ฅผ ์ „๋‹ฌํ•˜๊ณ , ํ”ผ์‚ฌ์ฒด์˜ ์ •๋ณด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์ž˜๋ผ๋‚ธ ํ”„๋ ˆ์ž„์„ ์†ก์ถœํ•ฉ๋‹ˆ๋‹ค. ์ดํ›„ ๋‘ ๋ฒˆ์งธ ํ”„๋ ˆ์ž„๋ถ€ํ„ฐ๋Š” ์ด์ „ detection ์ •๋ณด์™€ ์ด์ „ ์ด๋ฏธ์ง€์™€ ํ˜„์žฌ ์ด๋ฏธ์ง€์˜ image feature ๋น„๊ต ์ •๋ณด๋ฅผ ๊ฐ€์ง€๊ณ  ํ˜„์žฌ ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค. tracker๋กœ ์ถ”์ •๋œ ์œ„์น˜ ์ •๋ณด๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ๋ฆฌํ”„๋ ˆ์ž„ํ•˜์—ฌ ํ”„๋ ˆ์ž„๋“ค์„ ์†ก์ถœํ•˜๊ณ , ํ•ด๋‹น ๊ณผ์ •์„ 4๋ฒˆ ๊ฑฐ์นœ ํ›„์— ๋‹ค์‹œ object detection ํ”„๋ ˆ์ž„์œผ๋กœ ๋Œ์•„์˜ค๊ฒŒ ๋ฉ๋‹ˆ๋‹ค. ์ฆ‰ detection ํ”„๋ ˆ์ž„๊ณผ tracking ํ”„๋ ˆ์ž„์˜ ๋น„์œจ์„ 1:4๋กœ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์ค‘๊ฐ„์— tracker๊ฐ€ ์ธ์‹ ๋๋˜ ํ”ผ์‚ฌ์ฒด๋ฅผ ๋†“์น˜๋Š” ๊ฒฝ์šฐ์—๋Š” ๋‹ค์Œ ํ”„๋ ˆ์ž„์—์„œ object detection์„ ํ•˜๋„๋ก ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

๊ฒฐ๊ณผ

์œ„์˜ ๋ฐฉ์‹์„ ์‚ฌ์šฉํ•˜์˜€์„ ๋•Œ ํ”ผ์‚ฌ์ฒด๊ฐ€ ์—ฌ๋Ÿฌ ๊ฐœ์ธ ๊ฒฝ์šฐ์—๋„ ์ด์ „๋ณด๋‹ค ํ”„๋ ˆ์ž„์ด ํ”๋“ค๋ฆฌ๋Š” ํ˜„์ƒ์ด ๋งค์šฐ ์ ์–ด์กŒ์Šต๋‹ˆ๋‹ค.

๋งค ํ”„๋ ˆ์ž„๋งˆ๋‹ค detection์œผ๋กœ ํ”ผ์‚ฌ์ฒด์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ํŒŒ์•…ํ•  ๋•Œ์™€ ๋‹ฌ๋ฆฌ object tracker๋ฅผ ํ†ตํ•ด ์ธ์‹๋œ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ง€์†์ ์œผ๋กœ ์ถ”์ ํ•˜๊ณ , ๊ทธ ๊ณผ์ •์—์„œ ๊ฐ ํ”„๋ ˆ์ž„์˜ ํŠน์ง•์„ ์ถ”์ถœํ•˜์—ฌ ์‚ฌ์šฉํ•˜๋ฉด์„œ ์˜์ƒ ๋‚ด์˜ ํ๋ฆ„์„ ๋ฐ˜์˜ํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋˜์—ˆ๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

๋˜ํ•œ object tracker๋กœ ๊ฐ์ฒด๋ฅผ ์ถ”์ ํ•  ๋•Œ ์ฒ˜์Œ ์ธ์‹๋˜์—ˆ๋˜ ํ”ผ์‚ฌ์ฒด๋ฅผ ์œ„์ฃผ๋กœ tracking์ด ์ด๋ฃจ์–ด์ง€๋ฉด์„œ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ํ”„๋ ˆ์ž„์ด ํ”๋“ค๋ฆฌ๋Š” ํ˜„์ƒ์ด ์ค„์–ด๋“ค์—ˆ์Šต๋‹ˆ๋‹ค.

๊ฒฐ๋ก (์š”์•ฝ ๋ฐ ํ–ฅํ›„ ์—ฐ๊ตฌ ๊ณผ์ œ)

์š”์•ฝ

FastMOT object tracker๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ง€์†์ ์ด๊ณ  ์žฅ๊ธฐ์ ์œผ๋กœ ๊ฐ์ฒด๋ฅผ ์ถ”์ ํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜์˜€์Šต๋‹ˆ๋‹ค. object detection๋งŒ์œผ๋กœ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ถ”์ ํ•˜๋Š” ์ดˆ๊ธฐ ์ ‘๊ทผ ๋ฐฉ์‹์—์„œ object tracking ๋ฐฉ์‹์œผ๋กœ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ถ”์ ํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๋ฐ”๊พธ์—ˆ๋˜ ๊ฒƒ์ด ์—ฐ๊ตฌ์—์„œ ํฐ ์„ฑ๊ณผ๋ฅผ ๋ณด์˜€์Šต๋‹ˆ๋‹ค.

ํ–ฅํ›„ ์—ฐ๊ตฌ ๊ณผ์ œ

Visual Transformer ๊ตฌ์กฐ์—์„œ ํŒŒ์ƒ๋œ object detection ๋ชจ๋ธ์ธ DINO, SwinV2-G ๋“ฑ SOTA ํ˜น์€ ๊ทธ์— ์ค€ํ•˜๋Š” ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด ์—ฐ๊ตฌ๊ฐ€ ๋”์šฑ ๋ฐœ์ „๋  ๊ฒƒ์ด๋ผ๊ณ  ์ƒ๊ฐ๋ฉ๋‹ˆ๋‹ค. Object detection ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์ด ์ข‹์•„์งˆ์ˆ˜๋ก Object tracker์˜ ์„ฑ๋Šฅ๋„ ์ข‹์•„์ง€๊ธฐ ๋•Œ๋ฌธ์— ๋ชจ๋ธ์˜ ๊ต์ฒด๊ฐ€ ๋” ํฐ ํšจ๊ณผ๋ฅผ ๋ฐœํœ˜ํ•  ๊ฒƒ์ด๋ผ๊ณ  ์˜ˆ์ƒํ•ฉ๋‹ˆ๋‹ค. ๋‹ค๋งŒ Object detection ๋ชจ๋ธ์„ ๊ต์ฒดํ•  ๊ฒฝ์šฐ, ํ”„๋ ˆ์ž„์„ ์ˆœ์ฐจ์ ์œผ๋กœ ์ฒ˜๋ฆฌํ•˜๋Š” ๊ธฐ์กด ์•Œ๊ณ ๋ฆฌ์ฆ˜๋Œ€๋กœ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด detection ํ”„๋ ˆ์ž„์—์„œ๋Š” ์†ก์ถœ์ด ๋งค์šฐ ๋А๋ ค์ง€๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒํ•  ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๋ฉ€ํ‹ฐํ”„๋กœ์„ธ์‹ฑ์„ ์ ‘๋ชฉํ•œ๋‹ค๋ฉด ํ•ด๋‹น ๋ฌธ์ œ๊ฐ€ ํ•ด๊ฒฐ๋˜์ง€ ์•Š์„๊นŒ ์ƒ๊ฐํ•ฉ๋‹ˆ๋‹ค.

Appendix

  • fps๋ž€?

fps(frame per second)๋Š” 1์ดˆ๋‹น ๋ณด์—ฌ์ง€๋Š” ํ”„๋ ˆ์ž„์˜ ๊ฐœ์ˆ˜๋ฅผ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค. ์‚ฌ๋žŒ์˜ ๋ˆˆ์— ์˜์ƒ์ด ์‹ค์‹œ๊ฐ„์œผ๋กœ ์›€์ง์ด๋Š” ๊ฒƒ์ฒ˜๋Ÿผ ๋ณด์ด๋ ค๋ฉด fps๊ฐ€ 30 ์ด์ƒ์ด์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

  • bounding box

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-06 แ„‹แ…ฉแ„Œแ…ฅแ†ซ 11 35 35

Object detection์—์„œ ๊ฐ์ฒด์˜ ์œ„์น˜๋ฅผ ํ‘œํ˜„ํ•  ๋•Œ ์‚ฌ์šฉํ•˜๋Š” ์ง์‚ฌ๊ฐํ˜• ๋ชจ์–‘์˜ ๋ฐ•์Šค์ž…๋‹ˆ๋‹ค.

x์ถ•๊ณผ y์ถ•์„ ํ†ตํ•ด ํ‘œํ˜„ํ•˜๋ฉฐ, (x1, y1)์€ x๊ฐ’๊ณผ y๊ฐ’์˜ ์ตœ์†Ÿ๊ฐ’์„ ๋‚˜ํƒ€๋‚ด๊ณ  (x2, y2)์€ x,y ๊ฐ’์˜ ์ตœ๋Œ“๊ฐ’์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค.

  • mAP score

mAP(mean Average Precision)๋Š” Object detector์˜ ์ •ํ™•๋„๋ฅผ ํ‰๊ฐ€ํ•  ๋•Œ ์ฃผ๋กœ ์‚ฌ์šฉ๋˜๋Š” ํ‰๊ฐ€์ง€ํ‘œ์ž…๋‹ˆ๋‹ค. mAP์— ๋Œ€ํ•ด ์•Œ๊ธฐ ์ „์— IOU, precision, recall๊ฐœ๋…์— ๋Œ€ํ•ด ์•Œ์•„์•ผ ํ•ฉ๋‹ˆ๋‹ค.

Object detection์—์„œ ๊ฐ์ฒด์˜ ์œ„์น˜๋ฅผ ๋งž๊ฒŒ ์ถ”์ •ํ–ˆ๋Š”์ง€ ํŒ๋‹จํ•  ๋•Œ IOU๋ผ๋Š” ๊ฐœ๋…์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-08 แ„‹แ…ฉแ„’แ…ฎ 10 30 00

IOU(Intersection over Union)๋Š” ๊ต์ง‘ํ•ฉ์˜ ์˜์—ญ / ํ•ฉ์ง‘ํ•ฉ์˜ ์˜์—ญ์ž…๋‹ˆ๋‹ค.

Object detector๊ฐ€ ์˜ˆ์ธกํ•œ bounding box์™€ ์‹ค์ œ ๊ฐ์ฒด์˜ bounding box(ground-truth) ์˜์—ญ ๊ฐ„์˜ IOU ๊ฐ’์ด 0.5 ์ด์ƒ์ผ ๋•Œ Object detector๊ฐ€ ์˜ณ๊ฒŒ ์˜ˆ์ธกํ–ˆ๋‹ค๊ณ  ํŒ๋‹จํ•˜๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-08 แ„‹แ…ฉแ„’แ…ฎ 10 16 47

  • $precision = { TP \over TP + FP }$

precision(์ •๋ฐ€๋„)์€ ๋ชจ๋ธ์ด True๋ผ๊ณ  ์˜ˆ์ธกํ•œ ๊ฒƒ๋“ค ์ค‘์—์„œ ์‹ค์ œ๋กœ True์ธ ๊ฒƒ์˜ ๋น„์œจ์ž…๋‹ˆ๋‹ค.

  • $recall = { TP \over TP + FN }$

recall(์žฌํ˜„๋ฅ )์€ ์‹ค์ œ๋กœ True์ธ ๊ฒƒ๋“ค ์ค‘์—์„œ ๋ชจ๋ธ์ด True๋ผ๊ณ  ํ•œ ๊ฒƒ์˜ ๋น„์œจ์ž…๋‹ˆ๋‹ค.

๋‘˜ ์ค‘ ํ•œ ๊ฐ’๋งŒ์œผ๋กœ๋Š” ์ข‹์€ ๋ชจ๋ธ์„ ํŒ๋‹จํ•˜๊ธฐ ์–ด๋ ต๊ธฐ ๋•Œ๋ฌธ์— precision๊ณผ recall์„ ๋™์‹œ์— ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋ธ์„ ํ‰๊ฐ€ํ•˜๊ฒŒ ๋ฉ๋‹ˆ๋‹ค. precision๊ณผ recall ๊ฐ’์œผ๋กœ ๊ทธ๋ฆฐ ๊ทธ๋ž˜ํ”„์˜ ์•„๋ž˜ ๋ฉด์ ์„ AP(Average Precision)๋กœ ์ •์˜ํ•˜๊ณ  ๊ฐ ํด๋ž˜์Šค์—์„œ ๊ตฌํ•œ AP์˜ ํ‰๊ท ์ธ mAP๋ฅผ ํ†ตํ•ด Object detector์˜ ์„ฑ๋Šฅ์„ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.

  • DeepSORT

    • Matching cascade

    ์ตœ๊ทผ์— ์ƒ์„ฑ๋œ track์— ์šฐ์„  ์ˆœ์œ„๋ฅผ ๋ถ€์—ฌํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์ž…๋‹ˆ๋‹ค. ๊ฐ€์žฅ ์ตœ๊ทผ์— ์ƒ์„ฑ๋œ track์ผ์ˆ˜๋ก ๋” ์ •ํ™•ํ•œ ์ถ”์ ์ด๊ณ , ๋‚˜์ค‘์— ์ƒ์„ฑ๋œ track์ผ์ˆ˜๋ก ๋ถˆํ™•์‹คํ•œ ์ถ”์ ์ด๋ผ๊ณ  ํŒ๋‹จํ•˜๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-09 แ„‹แ…ฉแ„’แ…ฎ 3 18 12

  • Matching Cascade๊ฐ€ ์ผ์–ด๋‚˜๋Š” ๊ณผ์ •์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

    • Track๊ณผ Detection ๊ฐ„์˜ cost matrix๋ฅผ ๊ตฌํ•ฉ๋‹ˆ๋‹ค. cost matrix๋Š” Mahalanobis distance์™€ cosine distance๋ฅผ ํ†ตํ•ด ๊ตฌํ•˜๋Š”๋ฐ, ๋…ผ๋ฌธ์—์„œ๋Š” cosine distance๋งŒ์„ ์‚ฌ์šฉํ–ˆ์„ ๋•Œ ์˜คํžˆ๋ ค ์„ฑ๋Šฅ์ด ์ข‹๋‹ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค.

    • track์— ๋Œ€ํ•œ ์ถ”์ • ํƒ€๋‹น์„ฑ์„ ๊ฒ€์ฆํ•˜๋Š” gate matrix๋ฅผ ๊ตฌํ•ฉ๋‹ˆ๋‹ค. Track๊ณผ Detection ์‚ฌ์ด์˜ cost์— ์ž„๊ณ„๊ฐ’์„ ์„ค์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ํ†ตํ•ด ์žˆ์„ ๊ฒƒ ๊ฐ™์ง€ ์•Š๋‹ค๊ณ  ํŒ๋‹จ๋˜๋Š” track์„ ์ œ๊ฑฐํ•ฉ๋‹ˆ๋‹ค.

    • matched detections๋ฅผ ๊ณต์ง‘ํ•ฉ์œผ๋กœ, unmatched detections๋ฅผ Detections๋กœ ์ดˆ๊ธฐํ™”ํ•ฉ๋‹ˆ๋‹ค.

    • ๊ฐ age๋ฅผ ๊ฐ€์ง€๋Š” track๋“ค์„ ์ˆœ์ฐจ์ ์œผ๋กœ ํƒ์ƒ‰ํ•˜๋ฉด์„œ, linear assignment๋ฅผ ์ง„ํ–‰ํ•ฉ๋‹ˆ๋‹ค. ์ด๋•Œ matching๋˜์ง€ ์•Š์€ track๋“ค์€ ์ œ๊ฑฐํ•ฉ๋‹ˆ๋‹ค.

    • Re-identification

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-06 แ„‹แ…ฉแ„’แ…ฎ 3 31 27

แ„‰แ…ณแ„แ…ณแ„…แ…ตแ†ซแ„‰แ…ฃแ†บ 2022-10-06 แ„‹แ…ฉแ„’แ…ฎ 3 31 52

SORT๋Š” ID switching์˜ ๋ฌธ์ œ๋ฅผ ๊ฐ–๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ID switching์€ ๋‹ค์–‘ํ•œ ๊ฐ์ฒด๋ฅผ ์ถ”์ ํ•  ๋•Œ, ๊ฐ ๊ฐœ์ฒด์˜ track ID๊ฐ€ ๋ฐ”๋€Œ๋Š” ํ˜„์ƒ์ž…๋‹ˆ๋‹ค. ID switching์€ tracking์˜ ์„ฑ๋Šฅ์„ ์ €ํ•˜์‹œํ‚ค๋Š” ์š”์ธ์ด๊ธฐ ๋•Œ๋ฌธ์— ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ ์ž Re-identification ๋ชจ๋ธ์„ ์ ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค. 

Person Re-identification์€ ํŠน์ • ์‚ฌ๋žŒ์„ ๋‹ค์–‘ํ•œ ๊ฐ๋„๋‚˜ ์œ„์น˜์— ์žˆ๋Š” ๋‹ค๋ฅธ ์ด๋ฏธ์ง€๋“ค์—์„œ ์ฐพ๋Š” task์ž…๋‹ˆ๋‹ค. CNN์˜ feature space ์ƒ์—์„œ ๋™์ผํ•œ ์‚ฌ๋žŒ์— ๋Œ€ํ•œ feature๋Š” feature ์‚ฌ์ด์˜ ๊ฑฐ๋ฆฌ๊ฐ€ ๊ฐ€๊น๊ฒŒ mappingํ•˜๊ณ , ๋‹ค๋ฅธ ์‚ฌ๋žŒ์— ๋Œ€ํ•œ feature๋Š” feature ์‚ฌ๋ฆฌ์˜ ๊ฑฐ๋ฆฌ๊ฐ€ ๋ฉ€๊ฒŒ mappingํ•˜๋Š” ๋ฐฉ์‹์„ ํ†ตํ•ด ์‚ฌ๋žŒ์˜ ํŠน์ง•์„ ์ž˜ ํŒŒ์•…ํ•˜๋Š” ๋ชจ๋ธ์„ ์–ป์„ ์ˆ˜ ์žˆ๊ณ , ์ด ๋ชจ๋ธ์ด OSNet์ž…๋‹ˆ๋‹ค. 

DeepSORT์—์„œ๋Š” OSNet์„ ํ™œ์šฉํ•˜์—ฌ detection๋œ ํ”ผ์‚ฌ์ฒด๋ฅผ ์ž˜๋ผ๋‚ธ ํ”„๋ ˆ์ž„์—์„œ image feature๋ฅผ ์ถ”์ถœํ•ฉ๋‹ˆ๋‹ค. ํ•ด๋‹น ์ •๋ณด๋ฅผ tracking์— ์‚ฌ์šฉํ•˜์—ฌ ID switching ๋ฌธ์ œ๋ฅผ 45% ๊ฐ์†Œ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

Reference

[1] Google Research, "AutoFlip: An Open Source Framework for Intelligent Video Reframing", https://ai.googleblog.com/2020/02/autoflip-open-source-framework-for.html (accessed Oct. 06, 2022)

[2] Apple (2021), "Capture high-quality photos using video formats", https://developer.apple.com/videos/play/wwdc2021/10047/?time=808

[3] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg (2016), "SSD: Single Shot MultiBox Detector", https://arxiv.org/pdf/1512.02325.pdf

[4] Google Research (2019), "BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs", https://arxiv.org/pdf/1907.05047.pdf

[5] Alex Bewley, Zongyuan Ge, Lionel Ott, Fabio Ramos, Ben Upcroft (2016), "Simple Online and Realtime Tracking", https://arxiv.org/pdf/1602.00763.pdf

[6] Nicolai Wojke, Alex Bewley, Dietrich Paulus (2017), "Simple Online and Realtime Tracking with a Deep Association Metric", https://arxiv.org/pdf/1703.07402.pdf

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors