Deep Learning to See

Towards New Foundations of Computer Vision
Langbeschreibung
The remarkable progress in computer vision over the last few years is, by and large, attributed to deep learning, fueled by the availability of huge sets of labeled data, and paired with the explosive growth of the GPU paradigm. While subscribing to this view, this work criticizes the supposed scientific progress in the field, and proposes the investigation of vision within the framework of information-based laws of nature.This work poses fundamental questions about vision that remain far from understood, leading the reader on a journey populated by novel challenges resonating with the foundations of machine learning. The central thesis proposed is that for a deeper understanding of visual computational processes, it is necessary to look beyond the applications of general purpose machine learning algorithms, and focus instead on appropriate learning theories that take into account the spatiotemporal nature of the visual signal.Serving to inspire and stimulate critical reflection and discussion, yet requiring no prior advanced technical knowledge, the text can naturally be paired with classic textbooks on computer vision to better frame the current state of the art, open problems, and novel potential solutions. As such, it will be of great benefit to graduate and advanced undergraduate students in computer science, computational neuroscience, physics, and other related disciplines.
Hauptbeschreibung
Presents a curiosity-driven approach, encouraging novel computational models of vision
Inhaltsverzeichnis
1. Introduction.- 2. Cutting the Umbilical Cord with Pattern Recognition.- 3. Spatiotemporal Visual Environments.- 4. Hierarchical Description of Visual Tasks.- 5. Benchmarks and the "En Plein Air" Challenge.
Stefano Melacci is an Associate Professor at the Department of Information Engineering and Mathematics, University of Siena (Siena, Italy), where he received his PhD (2010) and the M.S. Degree (cum Laude) in Computer Engineering. He worked as a Researcher in the Academy and in the Industry (QuestIT S.r.l., Italy), and he has been a Visiting Scientist at the Ohio State University, Columbus (OH), USA. His profile is strongly characterized by research activity in the field of Machine Learning and, more generally, in Artificial Intelligence. Recently, he focussed on the problem of Continuous Learning from Video Streams, studying the role of motion and attention. He contributed to the unifying framework of Learning from Constraints that allows classic learning models to integrate symbolic knowledge representations. Prof. Melacci serves as Associate Editor of the IEEE Transactions on Neural Networks and Learning Systems, and he is an active reviewer for several journals and conferences in the field of Machine Learning.
ISBN-13:
9783030909864
Veröffentl:
2022
Erscheinungsdatum:
27.04.2022
Seiten:
120
Autor:
Alessandro Betti
Gewicht:
195 g
Format:
235x155x7 mm
Serie:
SpringerBriefs in Computer Science
Sprache:
Englisch

58,84 €*

Lieferzeit: Print on Demand - Lieferbar innerhalb von 3-5 Werktageni
Alle Preise inkl. MwSt. | zzgl. Versand