Intelligent temporal subsampling of American Sign Language using event boundaries

J Exp Psychol Hum Percept Perform. 1990 May;16(2):282-94. doi: 10.1037//0096-1523.16.2.282.

Abstract

How well can a sequence of frames be represented by a subset of the frames? Video sequences of American Sign Language (ASL) were investigated in two modes: dynamic (ordinary video) and static (frames printed side by side on the display). An activity index was used to choose critical frames at event boundaries, times when the difference between successive frames is at a local minimum. Sign intelligibility was measured for 32 experienced ASL signers who viewed individual signs. For full gray-scale dynamic signs activity-index subsampling yielded sequences that were significantly more intelligible than when every mth frame was chosen. This result was even more pronounced for static images. For binary images, the relative advantage of activity subsampling was smaller. We conclude that event boundaries can be defined computationally and that subsampling from event boundaries is better than choosing at regular intervals.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Adolescent
  • Adult
  • Attention*
  • Concept Formation*
  • Humans
  • Image Processing, Computer-Assisted
  • Manual Communication*
  • Middle Aged
  • Sign Language*
  • Video Recording
  • Visual Perception*