INTRODUCTION (1 week)

      Overview of the class

      Intro to Computational Perception

      • ``2001: HAL's Legacy'', PBS Show. The documentary was produced by David Kennard and Michael O'Connell (InCA Productions) and funded by the Alfred P. Sloan Foundation.
      • Rosenfeld, A. (1997). ``Eyes for Computers: How HAL could see?'', Chapter 10 in ``HAL's Legacy, 2001's Computer as Dream and Reality'', Stork, D. (Editor), MIT Press.
      • Irfan A. Essa (1999). ``Computers Seeing People'', AI Magazine 20(2): pp. 69-82.
      • (Optional reading) David Stork (1998). ``HAL's Legacy: 2001's computer as dream and reality'', MIT Press.


    TUTORIALS AND BACKGROUND MATERIAL (1 week)

      Alex's Matlab Tutorial

      OpenCV Tutorial

      Speech Recognition Packages Tutorial

      Review of Probability and Linear Algebra


    BASIC IMAGE PROCESSING (2 weeks)

      Mathematical Morphology

      • Jain, Kasturi, and Schunck (1995). Machine Vision, ``Chapter 2: Binary Image Processing,'' McGraw-Hill, pp. 25-72.
      • Haralick and Shapiro (1993). Computer and Robot Vision, "Chapter 5: Mathematical Morphology," Addison-Wesley.

      Image Filtering

      • Jain, Kasturi, and Schunck (1995). Machine Vision, ``Chapter 4: Image Filtering,'' McGraw-Hill, pp. 112-139.
      • Burt and Adelson (1983). ``The Laplacian Pyramid as a Compact Image Code,'' IEEE Transactions on Communications, vol. 31(4), pp. 532-540.


    COLOR AND MOVEMENT (1 week)

      Color and Skin detection

      • Yang, Lu, and Waibel (1997). ``Skin-color modeling and adaptation'', CMU-CS-97-146, May 1997.

      Motion Energy and Motion History

      • A. F. Bobick and J.W. Davis. ``An apearance-based representation of action''. In Proceedings of IEEE International Conference on Pattern Recognition 1996, August 1996, pp. 307-312.
      • Davis, J. and A. Bobick (1997). ``The Representation and Recognition of Action Using Temporal Templates'', In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, June 1997, pp. 928-934.

      Applications

      • J. Yang, W. Lu, and A. Waibel (1998). ``A real time face tracker''. In Proceedings of Asian Conference on Computer Vision (ACCV), volume 2, pp. 687-694.
      • A. Bobick, S. Intille, J. Davis, F. Baird, C. Pinhanez, L. Campbell, Y. Ivanov, A. Schutte, and A. Wilson (1999). ``The Kidsroom: A Perceptually-Based Interactive and Immersive Story Environment", Presence: Teleoperators and Virtual Environments, Vol. 8, No. 4, 1999, pp. 367-391.
      • J. Davis and A. Bobick (1998). ``Virtual PAT: A Virtual Personal Aerobics Trainer'', Workshop on Perceptual User Interfaces, November 1998, pp. 13-18.


    FACE DETECTION AND RECOGNITION (1 week)

      Eigenfaces

      • M. Turk and A. Pentland (1991). ``Eigenfaces for recognition''. Journal of Cognitive Neuroscience, 3(1).
      • Dana H. Ballard (1999). ``An Introduction to Natural Computation (Complex Adaptive Systems)'', Chapter 4, pp 70-94, MIT Press.

      Neural Network-Based Approaches

      • Henry A. Rowley, Shumeet Baluja and Takeo Kanade (1997). ``Rotation Invariant Neural Network-Based Face Detection,'' Carnegie Mellon Technical Report, CMU-CS-97-201.

      Cascades

      • Paul Viola and Michael Jones (2001). ``Robust Real-time Object Detection'', Second International Workshop on Statistical and Computational Theories of Vision Modeling, Learning, Computing, and Sampling, Vancouver, Canada, July 13, 2001.


    THE SENSE OF SELF (1 week)

      Phantoms in the Brain

      • Ramachandran, V.S. and S. Blakeslee (1998). "Phantoms in the Brain: Probing the Mysteries of the Human Mind", William Morrow, New York. pp. 1-62.
      • Melzack, R. (1992). "Phantom Limbs", Scientific American, 266, April, pp. 120-126.

      Sensory Substitution

      • New Scientist (2005). Cover story: "Why you have at least 21 senses", January 29, pp. 33-43.
      • Andy Clark, (2003). "Who are we?", Ch. 5 in Natural-Born Cyborgs: Minds, Technologies, and the Future of Human Intelligence, Oxford University Press.
      • P. Bach-y-Rita, C. C. Collins, F. Sauders, B. White, and L. Scadden, (1969), ``Vision substitution by tactile image projection''. Nature, 221, pp. 963-964.
      • Paul Bach-y-Rita and Stephen W. Kercel (2003). ''Sensory substitution and the human-machine interface'' , Trends Cogn Sci, Dec;7(12):541-6.


    PRELIMINARY PROJECT PRESENTATIONS (1 week)


    TOPIC TO BE DETERMINED (1 week)


    SPRING BREAK (1 week)


    TRACKING TECHNIQUES (1.5 weeks)

      Kalman Filter

      • Maybeck, Peter S. (1979). Chapter 1 in ``Stochastic models, estimation, and control'',Mathematics in Science and Engineering Series, Academic Press.
      • Greg Welch and Gary Bishop (2001). SIGGRAPH 2001 Course: ``An Introduction to the Kalman Filter''.

      Particle Filters

      • F. Dellaert, D. Fox, W. Burgard, and S. Thrun (1999). "Monte Carlo Localization for Mobile Robots", IEEE International Conference on Robotics and Automation (ICRA99), May, 1999.
      • Ioannis Rekleitis (2004). A Particle Filter Tutorial for Mobile Robot Localization. Technical Report TR-CIM-04-02, Centre for Intelligent Machines, McGill University, Montreal, Quebec, Canada.
      • Michael Isard and Andrew Blake (1998). ``CONDENSATION -- conditional density propagation for visual tracking'', International Journal of Computer Vision, 29, 1, 5--28.


    HIDDEN MARKOV MODELS (1 week)

      Theory

      • Rabiner, Lawrence, and Juang (1993). ``Theory and Implementation of Hidden Markov Models'', Chapter 6 in Fundamentals of Speech Recognition, Prentice-Hall, pp. 321-389.

      Applications

      • Thad Starner and Alex Pentland (1996) "Real-Time American Sign Language Recognition from Video Using Hidden Markov Models" PAMI July 1997.
      • Tanawongsuwan, R., Stoytchev, A., and Essa, I. (1999). "Robust Tracking of People by a Mobile Robotic Agent", Technical Report GIT-GVU-99-19.
      • Stefan Waldherr, Roseli Romero, Sebastian Thrun (2000). ``A Gesture Based Interface for Human-Robot Interaction'', Autonomous Robots, Volume 9, Issue 2, September 2000, pp. 151 - 173.


    WHAT IS INTELLIGENCE? (1.5 week)

      Theories of Vision

      • J. K. O'Regan and A. Noe, (2001). ``A sensorimotor account of vision and visual consciousness'', Behavioral and Brain Sciences, 24(5), 939- 1011.

      What is Intelligence

      • Jeff Hawkins and Sandra Blakeslee, "On Intelligence: How a New Understanding of the Brain Will Lead to the Creation of Truly Intelligent Machines", Henry Holt, pp. 138-235, 2004.


    AFFECTIVE COMPUTING (1 week)

      • Rosalind W. Picard (1997). ``Affective Computing'', MIT Press.
      • Rosalind W. Picard (1995). ``Affective Computing'', MIT Media Lab TR-321, November 1995 (abbreviated version of the book).
      • A. R. Demasio (1994). ``Descartes' Error: Emotion, Reason and the Human Brain'',New York: Gosset/Putnam Press (excerpt).


    FINAL PROJECT PRESENTATIONS (1 week)


    TOTAL: 16 weeks