To improve the process of landmark labeling of training images, this paper proposes a new approach for estimating a set of landmarks for a large image ensemble with only a small number of manually labeled images from the ensemble.
Landmark labeling of training images is essential for many learning tasks in computer vision, such as object detection, tracking, and alignment. Image labeling is typically conducted manually, which is both labor-intensive and error prone. The authors’ approach, named semi-supervised least-squares congealing, aims to minimize an objective function defined on both labeled and unlabeled images. A shape model is learned on-line to constrain the landmark configuration. The process also employs a partitioning strategy to allow coarse-to-fine landmark estimation. Extensive experiments on facial images show that the proposed approach can reliably and accurately label landmarks for a large image ensemble starting from a small number of manually labeled images, under various challenging scenarios. (Publisher abstract provided)
Similar Publications
- Camera-View Augmented Reality: Overlaying Navigation Instructions on a Real-Time View of the Road
- Implementing Frequent Substance Use Testing with Swift-Certain-Fair Sanctions: Stakeholder Insights from a Pilot Program with Volunteer Probationers
- Assessing Triangulation Across Methodologies, Methods, and Stakeholder Groups: The Joys, Woes, and Politics of Interpreting Convergent and Divergent Data