Saving data by adding visual knowledge priors to Deep Learning.
Accepted papers
We showcase original works on data-efficient computer vision through live oral talks and a poster session.
Decisions are out.
Accepted posters
We invite researchers to present their recent published works on data-efficient computer vision as a poster.
Decisions are out.
VIPriors challenges
We host four data efficieny challenges on action recognition, classification, detection and segmentation.
Winners will be announced at ECCV.
About the workshop
Data is fueling deep learning, yet it is costly to gather and to annotate. Training on massive datasets has a huge energy consumption adding to our carbon footprint. In addition, there are only a select few deep learning behemoths which have billions of data points and thousands of expensive deep learning hardware GPUs at their disposal. This workshop focuses on how to pre-wire deep networks with generic visual inductive innate knowledge structures, which allows to incorporate hard won existing generic knowledge. Visual inductive priors are data efficient: what is built-in no longer has to be learned, saving valuable training data.
Excellent recent research investigates data efficiency in deep networks by exploiting other data sources through unsupervised learning, re-using existing datasets, or synthesizing artificial training data. However, not enough attention is given on how to overcome the data dependency by adding prior knowledge to deep nets. As a consequence, all knowledge has to be (re-)learned implicitly from data, making deep networks hard to understand black boxes which are susceptible to dataset bias requiring huge datasets and compute resources. This workshop aims to remedy this gap by investigating how to flexibly pre-wire deep networks with generic visual innate knowledge structures, which allows to incorporate hard won existing knowledge from physics such as light reflection or geometry.
The great power of deep neural networks is their incredible flexibility to learn. The direct consequence of such power, is that small datasets can simply be memorized and the network will likely not generalize to unseen data. Regularization aims to prevent such over-fitting by adding constraints to the learning process. Much work is done on regularization of internal network properties and architectures. In this workshop we focus on regularization methods based on innate priors. There is strong evidence that an innate prior benefits deep nets: adding convolution to deep networks yields a convolutional deep neural network (CNN) which is hugely successful and has permeated the entire field. While convolution was initially applied on images, it is now generalized to graph networks, speech, language, 3D data, video, etc. Convolution models translation invariance in images: an object may occur anywhere in the image, and thus instead of learning parameters at each location in the image, convolution allows to only consider local relations, yet, share parameters over all image locations. This allows a strong reduction in both number of parameters and examples to learn from. This workshop aims to further the great success of convolution, exploiting innate regularizing structures yielding a significant reduction of training data.
Program
GMT | IDT (GMT+3) | ||
---|---|---|---|
6:00 | 9:00 | Opening | Challenge winners will be announced. |
6:15 | 9:15 | Invited talk: Stéphane Mallat | “Priors versus Learning for Synthesis and Classification with Deep Networks” |
7:00 | 10:00 | Oral presentations | Accepted orals |
7:30 | 10:30 | Poster session | Accepted posters |
8:30 | 11:30 | Invited talk: Devi Parikh | |
9:15 | 12:15 | Invited talk: Efstratios Gavves | |
10:00 | 13:00 | Closing |
Invited speakers
Organizers
Nergis Tömen
Delft University of Technology
Osman Semih Kayhan
Bosch Security Systems B.V.
Marcos Baptista Ríos
Alice Biometrics S.L.
Lorenzo Brigato
Sapienza University of Rome
Contact
Email us at vipriors-ewi AT tudelft DOT nl