Index

System

Glimpse: Continuous, Real-Time Object Recognition on Mobile Devices - Chen et al., SenSys '15
- Cache the results to hide the network delivery and server processing latency
- Only send the frames that are largely different from the previous frames
Starfish: Efficient Concurrency Support for Computer Vision Applications - LiKamWa et al., MobiSys '15
- Track identical library calls and reuse computed results across multiple applications
The Design and Implementation of a Wireless Video Surveillance System - Zhang et al., MobiCom '15
- Group cameras monitoring the same area into clusters
- Uploads frames with high “utility”(e.g., object count)
- Uploads frames that are different from previous frames(e.g., different object counts)
MCDNN: An Approximation-Based Execution Framework for Deep Stream Processing Under Resource Constraints - Han et al., MobiSys '16
- Adaptively pick the best specialized model
Optasia: A Relational Platform for Efficient Large-Scale Video Analytics - Lu et al., SOCC '16
DeepEye: Resource Efficient Local Execution of Multiple Deep Vision Models using Wearable Commodity Hardware - Mathut et al., MobiSys '17
- A system that can run multiple cloud-scale DL models locally on wearable devices
- Interleave the loading of memory-intensive FC layers and the execution of compute-intensive convolution layers
Fast Video Classification via Adaptive Cascading of Deep Models - Shen et al., CVPR '17
- Leverage the short-term class skew using model cascade
- Train specialized video online
NoScope: Optimizing Neural Network Queries over Video at Scale - Kang et al., VLDB '17
- Model cascade: difference detector(MSE) → cheap/specialized model → full model
Live Video Analytics at Scale with Approximation and Delay-Tolerance - Zhang et al., NSDI '17
- Objective: support efficient real-time analytics for multiple queries which have different quality and lag goals
- Offline Phase: use profiler to get a set of pareto-optimal configurations(a combination of knobs) from resource-quality space (with a variant of greedy hill-climbing)
- Online Phase: periodically change running queries’ configurations/placement/resource allocation to maximize total utility(quality + lag goals)
Cachier: Edge-Caching for Recognition Applications - Drolia et al., ICDCS '17
- Use edge server as a cache with compute resources(similar to CDN)
LAVEA: Latency-aware Video Analytics on Edge Computing Platform - Yi et al., SEC '17
Neurosurgeon: collaborative intelligence between the cloud and the mobile edge - Kang et al., ASPLOS '17 [Morning Paper Summary]
- Observed that 1) data transfer latency is often higher than mobile computation latency, especially on wireless networks. 2) inside a model, data size is decreasing at the front-end whereas per-layer latency is higher at the back-end.
- NOTE: 2) isn't necessarily true for recent networks with global average pooling
Scanner: Efficient Video Analysis at Scale - Poms et al., SIGGRAPH '18
- Store videos as tables which are optimized for frame sampling on compressed videos
- Express frame operations as dataflow graphs
Mainstream: Dynamic Stem-Sharing for Multi-Tenant Video Processing - Jiang et al., ATC '18
- Transfer learning → execute common layers only once
- Processing more frames with shared DNN vs. greater per-frame accuracy with specialized DNN
Chameleon: Scalable Adaptation of Video Analytics - Jiang et al., SIGCOMM '18
- Resource-accuracy tradeoff is affected by some persistent characteristics, so we can reuse configurations over time → temporal correlation
- Video cameras with the same characteristics share the same best configurations → cross-camera correlations
- Configuration knobs independently impact accuracy → reduce search space
- Divide cameras into groups → periodically re-profile “leader” videos
AWStream: Adaptive Wide-Area Streaming Analytics - Zhang et al., SIGCOMM '18
- Objective: low latency and high accuracy stream processing in WAN
- Ask programmers to write degradation functions and profile those configurations
- Adaptively change the configuration at runtime → react to congestions
Focus: Querying Large Video Datasets with Low Latency and Low Cost - Hsieh et al., OSDI '18
- Enable low-latency and low-cost querying over large historical video datasets.
- At ingest time: classify objects using a cheap CNN, cluster similar objects(KNN search), and index each cluster using top-K most confident classification results.
- At query-time: looks up the ingest index for cluster centroids that match the class and classifies them using expensive CNN.
On-Demand Deep Model Compression for Mobile Devices: A Usage-Driven Model Selection Framework - Liu et al., MobiSys '18
- Adaptively select DNN compression techniques based on user demand(Acc/Storage/Comp cost/Latency/Energy)
Sprocket: A Serverless Video Processing Framework - Ao et al., SoCC '18
- Extend the idea of ExCamera - enable users to build more complex pipelines
- novel straggler mitigation strategy
Potluck: Cross-Application Approximate Deduplication for Computation-Intensive Mobile Applications - Guo et al., ASPLOS '18
ReXCam: Resource-Efficient, Cross-Camera Video Analytics at Scale - Jain et al., arXiv' 18
DeepLens: Towards a Visual Data Management System - Krishnan et al., CIDR '19
- Objective: Indexing and query optimization for VDMS(For complex queries like join)
- A novel model for encoding, indexing and storing lineage
VStore: A Data Store for Analytics on Large Videos - Xu et al., EuroSys '19
Networked Cameras Are the New Big Data Clusters - Jiang et al., HotEdgeVideo '19
- Proposes a new “camera cluster” abstraction
  - Saving computing resource
  - Resource Pooling
  - Improving analytics quality
  - Hiding low-level intricacies
Cracking open the DNN black-box: Video Analytics with DNNs across the Camera-Cloud Boundary - Emmons et al., HotEdgeVideo '19
- Split-brain inference
Scaling Video Analytics on Constrained Edge Nodes - Canel et al., SysML '19
- Assumption: relevant events are rare.
- Filter frames by using a micro, binary classifier that extract feature maps from base DNN
AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling - Chin et al., SysML '19
- Down-sampling images are sometimes beneficial in terms of accuracy(e.g., removing background noise)
- Adaptively scaling video to improve both speed and accuracy of object detectors
Bridging the Edge-Cloud Barrier for Real-time Advanced Vision Analytics - Wang et al., HotCloud 19
- Use super-resolution to enhance video quality before running analytics(related: NAS)
Edge Assisted Real-time Object Detection for Mobile Augmented Reality - Liu et al., MobiCom '19
- Dynamic RoI Encoding: decrease the encoding quality of uninterested areas(use the last processed frame as heuristic)
- (Dependency-aware) Parallel streaming and inference: divide frames into slices and parallelize the processing between slices
Scaling Video Analytics Systems to Large Camera Deployments - Jain et al., HotMobile '19
- Leverage cross-camera correlations to reduce resource usage and achieve higher inference accuracy
Visual Road: A Video Data Management Benchmark - Haynes et al., SIGMOD '19
- An auto-generated benchmark that evaluates the performance of VDBMS
- Let users place an arbitrary number of cameras, each with configurable position, resolution, and field of view
- Composite queries and automatically generated ground truth labels
Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels - Fu et al., arXiv '19
BlazeIt: Optimizing Declarative Aggregation and Limit Queries for Neural Network-Based Video Analytics - Kang et al., VLDB '20
- Objective: Support (approximate) aggregate and limit queries over large video dataset
- At ingest time, run object detection on small samples of frames and store them
- For each query, use them to train a query-specific proxy model
MIRIS: Fast Object Track Queries in Video - Bastani et al., SIGMOD '20
Server-Driven Video Streaming for Deep Learning Inference - Du et al., SIGCOMM '20
- Iterative video processing driven by server-side DNN
Reducto: On-Camera Filtering for Resource-Efficient Real-Time Video Analytics - Li et al., SIGCOMM '20
- Dynamically adapts filtering decisions based on feature type, threshold, etc.
Visor: Privacy-Preserving Video Analytics as a Cloud Service - Poddar - et al., Security '20
Panorama: A Data System for Unbounded Vocabulary Querying over Video - Zhang et al., VLDB '20
- A system that let users generalize to unbounded vocabularies without manual retraining
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming - Khani et al., arXiv '20

PreviousA Few Useful Things to Know About Machine Learning NextScaling Video Analytics on Constrained Edge Nodes

Last updated 4 years ago

Was this helpful?