Artificialvision applications, such as object detection in natural images and automaticsegmentation of medical acquisitions, rely on models that interpret the visualinformation provided to a computer. The model provides a compromise between thesupport given by the observations and the prior domain knowledge. This courseis concerned with the two computational problems that arise when using such modelsin practice.
Inference(Energy Minimization):
Given a visual observation (for example, an image or an MRI scan), weare interested in estimating its most likely interpretation (i.e. the location of allthe objects in the image, or the segments of the MRI scan) according to themodel. While the problem cannot be solved optimally, we will describe state of the art approximate algorithms that provide very accurate solutions inpractice. While the theoretical properties of the algorithms will be discussedbriefly, the main emphasis will be on their application.
Learning(Parameter Estimation):
Given a set of training samples consisting of inputs and their desiredoutputs, (for example, images and the location of the objects, or MRI scans andtheir segmentations) we would like to estimate a model that is suited to thetask at hand. We will show how the problem of learning a model can beformulated as empirical risk minimization. Furthermore, we will presentefficient algorithms for solving the corresponding optimization problem.