A novel approach which jointly optimizes a canonical time and pose configuration coupled with temporal and cyclic consistency constraints providing high quality outputs as the observed views become sparser.
Emergent communication via mid-level patches in a referential game played on a large-scale image dataset.