Ivo Georgiev
Created August 7, 2022

Real-time multi-camera 3D audio-visual scene segmentation

Multiple Spresence devices with camera, mic, and LTE observe the same scene and comprehend it in 3D in multiple sensor modalities.

21
Real-time multi-camera 3D audio-visual scene segmentation

Things used in this project

Hardware components

Spresense boards (main & extension)
Sony Spresense boards (main & extension)
Part of field capture device.
×2
Spresense camera board
Sony Spresense camera board
Part of field capture device (video).
×2
Spresense LTE extension board
Sony Spresense LTE extension board
Streaming from field capture device.
×2
TDK Corporation Omnidirectional environmental microphone
Part of field capture device (audio).
×4
LTE SIM Card (SORACOM)
×2

Software apps and online services

Sony Spresense SDK
TensorFlow
TensorFlow
OBS Studio by OBS Project
KiCad
KiCad

Story

Read more

Schematics

System overview

Three stages: (1) capture, edge-AI segmentation, and tagging, (2) streaming, (3) real-time whole-scene stitching and integration.

Code

Project repo

Credits

Ivo Georgiev

Ivo Georgiev

7 projects • 5 followers

Comments