Summer Camp

DAY 1 July 16th Sunday	08:30-09:00	Hui Huang	Opening Ceremony
	09:00-10:20	Daniel Cohen-Or	Introduction to Generative Adversarial Networks and their Applications
	10:40-12:00	Yu Qiao	Deeply Understanding Complex Human Activities and Scenes in the Wild
	14:30-15:50	Yizhou Yu	Deep Learning Techniques for Sketch-Based Face Modeling and Automatic Image/Video Stylization
	16:10-17:30	Andrei Sharf	Adaptive Shape Sensing with Neural-Networks
	19:00-21:00	Jiacheng Ren	3ds Max

DAY 2 July 17th Monday	09:00-10:20	Ariel Shamir	3D Fabrication For Everyone
	10:40-12:00	Yue Dong	Data driven appearance modeling
	14:30-15:50	Huamin Qu	Visual Analytics of Big Data
	16:10-17:30	Jin Huang	High Quality Hex Mesh Generation
	19:00-21:00	Jiacheng Ren	3ds Max

DAY 3 July 18th Tuesday	09:00-10:20	Hao Zhang	From Symmetry to Functionality: An Evolution to Understand 3D Shapes
	10:40-12:00	Manolis Savva	Holistic 3D Scene Understanding
	14:30-15:50	Lei Shi	Network Visualization and Iits Application to Academic Influence Analysis
	16:10-17:30	Zhaopeng Cui	Global Structure-from-Motion and Its Application
	19:00-21:00	Jiacheng Ren	3ds Max

DAY 4 July 19th Wednesday	09:00-10:20	Dani Lischinski	Gradient Domain Manipulation
	10:40-12:00	Oliver Deussen	Data Visualization: an Introduction
	14:30-16:30	Ruizhen Hu Pengfei Xu Di Lin Qian Zheng	Introduction on VCC Research and Open Discussion
	16:30-18:00	All	Closing Ceremony

Introduction to Generative Adversarial Networks and their Applications （Daniel Cohen-Or）　

Abstract：In this tutorial talk we will describe Generative Adversarial Networks (GANs). We will start by explaining the motivation and the theory behind these networks, which are considered by some as one of the most important developments in Artificial Intelligence. Next we will survey various extensions and applications of GANs, including the recent exciting cycleGAN/dualGAN architecture.

Deeply Understanding Complex Human Activities and Scenes in the Wild Convexity Analysis （Yu Qiao ）　

Abstract：Human activities and scenes understanding is receiving extensive research interests in computer vision nowadays due to its wide applications in surveillance, human-computer interface, sports video analysis, and content based video retrieval. The challenges of action and scene recognition come from background clutter, viewpoint changes, and motion and appearance variations. In this lecture, I will report our continuous efforts (CVPR13,14,15,16, ICCV13, ECCV 14, 15, TIP14, TIP 17, IJCV 16) to address these challenges. These works range from mining middle level parts, multi-view encoding of local descriptors, hierarchical model, to utilizing deep models for action recognition and detection. Experimental results on large public datasets (e.g. UCF101, HMDB51, SUN, LSUN) demonstrate the effectiveness of the proposed methods.

Deep Learning Techniques for Sketch-Based Face Modeling and Automatic Image/Video Stylization （Yizhou Yu）　

Abstract：Face modeling has been paid much attention in the field of visual computing. There exist many scenarios, including cartoon characters, avatars for social media, 3D face caricatures as well as face-related art and design, where low-cost interactive face modeling is a popular approach especially among amateur users. In this lecture, I present a deep learning based sketching system for 3D face and caricature modeling. This system has a labor-efficient sketching interface, that allows the user to draw freehand imprecise yet expressive 2D lines representing the contours of facial features. A novel CNN based deep regression network is designed for inferring 3D face models from 2D sketches. Our system also supports gesture based interactions for users to further manipulate initial face models. Both user studies and numerical results indicate that our sketching system can help users create face models quickly and effectively. A significantly expanded face database with diverse identities, expressions and levels of exaggeration is constructed to promote further research and evaluation of face modeling techniques. Color and tone stylization in images and videos strives to enhance unique themes with artistic color and tone adjustments. It has a broad range of applications from professional image postprocessing to photo sharing over social networks. I also present a novel deep learning architecture for exemplar-based image stylization, which learns local enhancement styles from image pairs. Image stylization can be efficiently accomplished with a single forward pass through our deep network. To extend our deep network from image stylization to video stylization, we exploit temporal superpixels (TSPs) to facilitate the transfer of artistic styles from image exemplars to videos. Experiments on a number of datasets for image stylization as well as a diverse set of video clips demonstrate the effectiveness of our deep learning architecture.

Deeply Understanding Complex Human Activities and Scenes in the Wild Convexity Analysis （Andrei Sharf ）　

Abstract：Convolutional neural networks (CNNs) require data from a regular domain, which prohibits direct processing of geometric data that is irregular by nature, such as point clouds and meshes. We present an approach for point-cloud analysis using a neural network that samples geometric information and maps it to a representation suitable for CNN consumption. A sensor, composed of a regular grid of probes embedded in space, samples vector-field values induced by the geometry. The captured information can then be readily processed by a CNN, extracting high-level features. Having the sensor module as an integral part of the network enables an iterative scheme in which the net applies multiple geometric transformations to both the sensor and the shape. We formulate several variations of the sensor module and discuss their merits, describe approaches to a few shape-analysis tasks using this framework, reason about its scalability, and present preliminary experiments and results on shape detection and classification.

3D Fabrication For Everyone （Ariel Shamir ）　

Abstract：With the progress of 3D printing and other fabrication techniques, the creation of real 3D object has become more accessible. This allowed the emergence of a new and exciting "makers movement".However, if designing a 3D object for virtual world (movies, games) was a difficult task for non-experts, then designing 3D objects that need to be manufactured and function is even more challenging. In this talk, I will describe several solutions that try to meet this challenge. One solution allows novice users to create new objects by taking parts from existing designs. Others allows the creation of connectors between objects, or designing foldable objects automatically. In all cases, behind the scene the system takes care of tedious and difficult tasks such as connecting constraints and physical feasibility.

Data driven appearance modeling （Yue Dong ）　

Abstract：Digitally reproducing the visual richness of the appearance of real-world materials is a challenging and important problem in computer graphics. In this talk, I will introduce several works in data driven appearance modeling and acquisitions. The main axis of those works are data coherency, I will discuss how different data coherency leads to various solutions and the relationship between those methods. Finally, I will also present our latest work that adapting deep learning techniques for appearance modeling.

Visual Analytics of Big Data （Huamin Qu ）　

Abstract：Big data is large and complex. Researchers have developed advanced data mining and machine learning techniques to reveal patterns in big data. However, people without a mathematics or computer science background may find these methods and findings difficult to understand. Also not all problems can be solved by automatic methods. Data visualization, which turns data into visual forms and keep human in the analytical loop, is widely considered a key to big data analytics. In this talk, I will first introduce the opportunities and challenges for visual analytics of big data. I will then present ongoing research projects at HKUST and demonstrate how visualization helps reveal rumor propagation on social media such as Twitter and WeChat, learning behaviors of students on MOOC platform, and human mobility patterns based on mobile phone and transportation data.

High Quality Hex Mesh Generation （Jin Huang）　

Abstract：Mesh is the most popular discretization method in many fields. What is a good mesh and how to get a good mesh are very basic problems. In this talk, I will briefly introduce some basic knowledge about them, then focus on our progress of answering these two questions. First, we developed a very efficient method of estimating condition number, the numerical stability indicator of a mesh. Second, we show how to address the key challenges in hex remeshing, the “holy grail” problem in remeshing, and discuss the possible future works.

From Symmetry to Functionality: An Evolution to Understand 3D Shapes （Hao Zhang）　

Abstract：Symmetry is ubiquitous in nature and man-made artifacts. In this talk, I will first show how symmetry analysis and organization can play a crucial role in understanding a 3D shape in a hierarchical way. In particular, it may seem surprising that one can infer the generative history of a pattern by analyzing its symmetry alone. Furthermore, I will show how a symmetry-induced hierarchical representation is the key to allow a machine to learn a generative and structure-aware model for 3D shapes. With an intimate connection between symmetry and functionality (e.g., symmetric parts in a 3D object tend to perform the same function), symmetry provides a first cue to understand object functionality, but the key missing piece is how an object interacts with its environment to perform its functions. In the second part of my talk, I will cover our recent works on functional understanding of 3D shapes, evolving from analyses in a static setting to inferring part motion from a single static snapshot.

Holistic 3D Scene Understanding （Manolis Savv ）　

Abstract：The tremendous progress of RGBD sensing technology and recent growth of organized 3D model repositories has led to a deluge of data that captures and represents the structure and semantics of real-world scenes. With this increasing availability of 3D scene data, research challenges are shifting towards analysis and high-level understanding of scene semantics.This talk will describe recent work exploring 3D scene understanding along three axes: i) large-scale acquisition and semantic labeling of RGBD reconstructions, ii) leveraging large-scale virtual 3D scene data to jointly predict the volumetric occupancy and semantics of a scene from single RGBD frames, and iii) investigating the impact of different rendering methods when generating data from virtual scenes to train deep learning methods for tasks such as normal prediction, semantic segmentation, and edge prediction.

Network visualization and its application to academic influence analysis （Lei Shi ）　

Abstract：Visually analyzing academic influence graphs, such as those in the citation network, poses challenge to many fields of the computer science research. How can we summarize a large graph according to user's interest? In particular, how can we illustrate the impact of a highly influential paper through the summarization? Can we maintain the sensory node-link visual metaphor while preserving both the influence flow patterns and fine readability? How to visualize the evolutionary pattern as the influence propagates? This talk starts from basics on network visualization, including the design rationale, various layout considerations and evaluation. After that, we demonstrate the visualization and data mining techniques developed in recent years to solve academic influence graph analysis problem.

Global Structure-from-Motion and Its Application （Zhaopeng Cui ）　

Abstract：Structure-from-motion (SfM) is a fundamental problem in 3D computer vision, with the aim of recovering camera poses and 3D scene structure simultaneously given a set of 2D images. SfM methods can be broadly divided into incremental and global methods according to their ways to register cameras. Incremental methods register cameras one by one, while global SfM methods solve all cameras simultaneously from all available relative motions. As a result, global SfM has better potential in both reconstruction accuracy and computation efficiency than incremental SfM. In this talk, I will first present our latest global SfM system which could deal with all kinds of camera motions and datasets. Then I will present our new robust video alignment algorithm based on global SfM, which can align videos taken at different times with substantially different appearances, in the presence of moving objects and moving cameras with slightly different trajectories.

Gradient Domain Manipulation （Dani Lischinski ）　

Abstract：Gradient domain manipulation is an important technique with multiple applications in image and video tone mapping and editing. In this talk, I will explain the relevant underlying mathematical theory and show how operating in the gradient domain can be used for tone mapping of High Dynamic Range (HDR) images, as well as for seamless image cloning and stitching.

Data Visualization: an Introduction （Oliver Deussen ）　

Abstract：In data visualization large amounts of data are converted into images in order to use the human eye to find interesting patterns that can later be analysed by data mining techniques. In my talk I will give brief overview about various data visualization techniques and will mention the most important research questions of this field.

Style and Functionality Analysis of 3D Shapes （Ruizhen Hu ）　

Abstract：The majority of man-made objects are designed to serve a certain function and process a certain style, and they are often reflected by the geometry of the objects. In recent years, many efforts in shape analysis have developed methods that extract high-level structural and semantic information from geometric shapes and scenes, especially involving man-made objects. Style and functionality have been receiving increasingly more attention in shape analysis and geometric modeling. In this talk, I will talk about recent developments that incorporate style and functionality aspects in the analysis of 3D shape.

Learning Image Representation for Classification and Semantic Segmentation （Di Lin ）　

Abstract：Image classification and semantic segmentation have been long standing topics in high-level computer vision. They emphasize understanding of scene context and object. Towards image classification, we address the problem regarding appropriate representation of the scene structure and differentiation among object categories. Compared to image classification, semantic segmentation task faces more challenges. Not only the global information of the image, but also the underlying relationship among pixels need to be captured by the image representation. Therefore large-scale data is vital to enable reliable learning of representation. We propose a weakly-supervised learning system for semantic segmentation, which can be used in practice to improve segmentation with cost-effective data.

Pattern-Aware Arrangement of Graphic Elements （Pengfei Xu ）　

Abstract：Arrangement of graphic elements is a fundamental task in many text and graphic editing scenarios. Traditional interactive techniques such as snapping and arrangement commands are provided by most graphic editors to aid the users in accomplishing this task. However, these techniques often ignore the underlying patterns of the elements, requiring the users to perform explicit and tedious operations to achieve the goal of the manipulation. It is expected that the amount of user interaction can be reduced by exploiting the patterns of the elements. Nevertheless, designing effective pattern-aware interaction tools is challenging due to the ambiguities in the patterns presented by the elements and the operations issued by the user. In this talk, I will introduce two effective pattern-aware interaction techniques to aid the users in the arrangement of elements. In particular, I will introduce 1) GACA, a group-aware command-based arrangement tool for arranging multiple groups of elements with a single command, and 2) A framework for automatic global beautification of layouts of graphic elements with gestural interface for editing the patterns of the elements. These two techniques adopt different strategies to resolve the ambiguities that may arise during manipulation, ensuring their usability.

Capture and Reconstruction of Blooming Flowers （Qian Zheng ）　

Abstract：Advances in affordable three-dimensional (3D) acquisition devices now provide new opportunities in capturing and modelling 3D real-life phenomena in time. In my talk I will present our recent work on capturing and reconstructing spatially and temporally coherent sequences of blooming flowers, and talk about some common techniques for registration and tracking.

Course Abstract

Introduction to Generative Adversarial Networks and their Applications （Daniel Cohen-Or）

Deeply Understanding Complex Human Activities and Scenes in the Wild Convexity Analysis （Yu Qiao）

Deep Learning Techniques for Sketch-Based Face Modeling and Automatic Image/Video Stylization （Yizhou Yu）

Deeply Understanding Complex Human Activities and Scenes in the Wild Convexity Analysis （Andrei Sharf）

3D Fabrication For Everyone （Ariel Shamir）

Data driven appearance modeling （Yue Dong）

Visual Analytics of Big Data （Huamin Qu）

High Quality Hex Mesh Generation （Jin Huang）

From Symmetry to Functionality: An Evolution to Understand 3D Shapes （Hao Zhang）

Holistic 3D Scene Understanding （Manolis Savv）

Network visualization and its application to academic influence analysis （Lei Shi）

Global Structure-from-Motion and Its Application （Zhaopeng Cui）

Gradient Domain Manipulation （Dani Lischinski）

Data Visualization: an Introduction （Oliver Deussen）

Style and Functionality Analysis of 3D Shapes （Ruizhen Hu）

Learning Image Representation for Classification and Semantic Segmentation （Di Lin）

Pattern-Aware Arrangement of Graphic Elements （Pengfei Xu）

Capture and Reconstruction of Blooming Flowers （Qian Zheng）

Introduction to Generative Adversarial Networks and their Applications （Daniel Cohen-Or）　

Deeply Understanding Complex Human Activities and Scenes in the Wild Convexity Analysis （Yu Qiao ）　

Deep Learning Techniques for Sketch-Based Face Modeling and Automatic Image/Video Stylization （Yizhou Yu）　

Deeply Understanding Complex Human Activities and Scenes in the Wild Convexity Analysis （Andrei Sharf ）　

3D Fabrication For Everyone （Ariel Shamir ）　

Data driven appearance modeling （Yue Dong ）　

Visual Analytics of Big Data （Huamin Qu ）　

High Quality Hex Mesh Generation （Jin Huang）　

From Symmetry to Functionality: An Evolution to Understand 3D Shapes （Hao Zhang）　

Holistic 3D Scene Understanding （Manolis Savv ）　

Network visualization and its application to academic influence analysis （Lei Shi ）　

Global Structure-from-Motion and Its Application （Zhaopeng Cui ）　

Gradient Domain Manipulation （Dani Lischinski ）　

Data Visualization: an Introduction （Oliver Deussen ）　

Style and Functionality Analysis of 3D Shapes （Ruizhen Hu ）　

Learning Image Representation for Classification and Semantic Segmentation （Di Lin ）　

Pattern-Aware Arrangement of Graphic Elements （Pengfei Xu ）　

Capture and Reconstruction of Blooming Flowers （Qian Zheng ）