The Data Set

Updated on 6/17/2007 3:48 PM

Raw image sequences

 

The data was collected over four days, May 20-21, 2001 and Nov 15-16, 2001 at University of South Florida (USF), Tampa. There are about 33 subjects common between the May and Nov collections. USF IRB approved informed consent was obtained from all the subjects in this dataset. The data set consists of persons walking in elliptical paths in front of the camera(s). Each person walked multiple (>= 5) circuits around an ellipse, out of which the last circuit forms the data set.  The protocol for the data collection, including specifications of the used imaging equipment, is described in detail HERE.  An example AVI file is available HERE.  For each person, we have up to 5 covariates

 

*      2 different shoe types (A, and B),

*      2 different carrying conditions (with or without a briefcase),

*      on 2 different surface types (grass and concrete),

*      from 2 different viewpoints (Left or Right) and

*      some at 2 different time instants

 

Thus, there are 32 possible conditions under which persons gait could have been imaged.  However, not all subjects were imaged in all conditions. The full data set can be partitioned as depicted in the following grid.

*      We refer to the cells with blue background as the May-2001-No-Briefcase data (used in our ICPR-02 and FGR-02 papers). 

*      We distribute only the full version of the dataset which consists of 1870 sequences from 122 subjects. The total size of the data is around 1.2 Tera bytes, uncompressed, and is distributed in compressed form using: one external 750GB (or more) drive. Please make sure your computer system can handle this large amount of data.

*      You will need to have linux to uncompress the data.

 

 

To the following address send the first four listed items. Also send an email, with tracking numbers, to (sarkar AT cseeDOTusfDOTedu) to look out for the disks.

 

            Sudeep Sarkar

                 Computer Science and Engineering

                  University of South Florida

                 4202 E Fowler Ave., ENB 118

                 Tampa, FL 33620

                 USA

 

  1. A copy of the release agreement, signed by a legal representative of your institution.
  2. One EXTERNAL 750 GB drive (firewire or USB) for the Full May-Nov-2001 data. Specify either linux's ext2 or MS VFAT.
  3. Prepaid return label with addresses to which drives will be sent AND a telephone number, for the carrier.
  4. Send an email, with tracking numbers, to (sarkar AT cseeDOTusfDOTedu) to look out for the disks.

 

Note 1: Please make sure your computer setup can handle terabytes of data and you have access to the linux machine to uncompress the data.

Note 2: Do not expect immediate service. It will take us couple of months to respond to each request.

Silhouettes Computed by the Baseline Algorithm

 

We make available the silhouettes that were computed by the baseline algorithm on the complete dataset, so that one can experiment with similarity computation methods. The complete set of silhouettes is available as a tar file from the “Source Code” page. The silhouettes, which as you will notice, are noisy, but do support the recognition rates reported by the simple correlation based similarity computing baseline strategy.