UT Egocentric (UT Ego) Dataset [Download (1.4GB)]
The Univ. of Texas at Austin Egocentric (UT Ego) Dataset contains 4 videos captured from head-mounted cameras. Each video is about 3-5 hours long, captured in a natural, uncontrolled setting.Data
We used the Looxcie wearable camera, which captures video at 15 fps at 320 x 480 resolution. Four subjects wore the camera for us: one undergraduate student, two graduate students, and one office worker. The videos capture a variety of activities such as eating, shopping, attending a lecture, driving, and cooking.
* Due to privacy reasons, we are able to share only 4 of the 10 videos originally captured (one from each subject). They correspond to the test videos that we evaluate on in both the CVPR 2012 and CVPR 2013 papers.
Can be evaluated with provided ground-truth above. Training/testing should be conducted in a leave-one-out fashion (i.e., train on 3 videos test on 1 remaining video). A region whose overlap score (intersection over union) with any ground-truth region is greater than 0.5 should be considered as a true positive (i.e., important object). See "Important region prediction accuracy" in Sec. 4 of CVPR 2012 for guidance on prior studies.
Requires human subject studies for evaluation. See "User studies to evaluate summaries" in Sec. 4 of CVPR 2012 and "Evaluating summary quality" in Sec. 4 of CVPR 2013 for guidance on prior studies.