Externally indexed torrent
If you are the original uploader, contact staff to have it moved to your account
Textbook in PDF format
As the first book of a three-part series, this book is offered as a tribute to pioneers in vision, such as Bela Julesz, David Marr, King-Sun Fu, Ulf Grenander, and David Mumford. The authors hope to provide foundation and, perhaps more importantly, further inspiration for continued research in vision. This book covers David Marr's paradigm and various underlying statistical models for vision. The mathematical framework herein integrates three regimes of models (low-, mid-, and high-entropy regimes) and provides foundation for research in visual coding, recognition, and cognition. Concepts are first explained for understanding and then supported by findings in psychology and neuroscience, after which they are established by statistical models and associated learning and inference algorithms. A reader will gain a unified, cross-disciplinary view of research in vision and will accrue knowledge spanning from psychology to neuroscience to statistics.
The primary aim of this book is to pursue knowledge representation for vision. In a world in which technology generates immense amounts of data from a wide spectrum of sources, it is of growing importance to establish a common framework for knowledge representation, learning, and discovery. A current problem in Artificial Intelligence is how to acquire truly massive amounts of knowledge, akin to the faculty of common sense in humans, from raw sensory signals and, moreover, use this knowledge for inference and reasoning. In this book, the vision models presented for acquiring such knowledge are based most fundamentally on statistical properties discovered for natural images over the past several decades. Similar to models in physics, models are advanced based on empirical grounds, such as discovering additional patterns in data. In this way, models stay free of bias.
1 Introduction
2 Statistics of Natural Images
3 Textures
4 Textons
5 Gestalt Laws and Perceptual Organization
6 Primal Sketch: Integrating Textures and Textons
7 2.1D Sketch and Layered Representation
8 2.5D Sketch and Depth Maps
9 Learning by Information Projection
10 Information Scaling
11 Deep Image Models
12 A Tale of Three Families: Discriminative, Descriptive, and Generative Models