EgoNRG: An Egocentric Multi-View Hand-Arm Segmentation and Classification Dataset for Real-World HRI in Military and Industrial Settings

Version 2.0

Regal, Frank; Nair, Sanat; Karmakar, Asha; Larina, Valentina; Vongchanh, Emma; Brass, Hallie; Roper, Landon; Matthew Olan; Pryor, Mitch, 2025, "EgoNRG: An Egocentric Multi-View Hand-Arm Segmentation and Classification Dataset for Real-World HRI in Military and Industrial Settings", https://doi.org/10.18738/T8/DC4J0Q, Texas Data Repository, V2

Learn about Data Citation Standards.

Contact Owner

Make Data Count (MDC) Metrics

since 2019-10-01

0 Views

0 Downloads

0 Citations

Description	Introduction The Egocentric Navigation Robot Gestures (EgoNRG) dataset is an egocentric hand gesture dataset designed to improved Human-Robot Interactions (HRI) in real-world industry, military, and first response applications. It contains 3,000 classified gesture videos and 160,000 pixel-based segmented images captured from 32 different participants. The participants were captured performing 11 non-verbal gestures adopted from the Army Field Manual and 1 generic, deictic, pointing gesture referencing abstract objects in indoor and outdoor environments. Highlights: Joint hand and arm segmentations of each participants' left and right limb. Participants' performed gestures with 1) long sleeves and gloves (wearing replica flame-resistant solid color clothing and military camouflage) and 2) bare skin to mimic conditions in real-world industrial and military environments. Environments with and without background people visible. Data captured in both indoor and outdoor environment at various points throughout the day (morning, midday, and dusk). Data captured from four synchronized monochrome cameras each with a different perspective. Gesture performed map directly to standard ground vehicle robot commands (stop, move forward, go left, move in reverse, etc.). Content The dataset contains: Videos from 32 participants (14 females / 18 males) in total performing 12 gestures in total. Participants were split into 4 groups of 8. Each group performed a set of 4 gestures. 3,044 (~2.5 hours) videos in total annotated with gesture type. Each gesture performed by each participant has four different recorded synchronized viewpoints associated with that gesture. 160,639 annotated frames with "Left Limb" and "Right Limb" pixel-based segmentations. The hands and arms of the participants were segmented together to create a joint segmentation for each respective limb. Collection Method The dataset was collected using the 4 VLC Monochrome cameras attached to the Microsoft HoloLens 2 headset. Each video stream provides an egocentric view of the participants hands and arms performing a wide variety of gestures from different perspectives. The perspectives include a wide left, central left, central right, and wide right camera that allows for detailed visual information of the gestures being performed across multiple cameras from multiple viewpoints. The headset streamed the video data to a remote server where the recorded data was synchronized and saved. Research assistants started and stopped the recording locally on board the headset via remote scripts. Three research assistants in total were tasked with the collection of the data over two months. Annotations The data was manually annotated by nine researchers. Three classes were assigned to each image: left limb, right limb, and background. Human annotators were instructed to annotate each limb as the joint hand and arm for all images they could tell the hand/arm of the participant was in the image. There were three steps to the annotation pipeline. The first step for the human annotators was to review left limb and right limb bounding boxes that were automatically generated using text prompts with GroundingDINO. Once the bounding boxes for each frame were varied, these images were then automatically segmented via Segment Anything 2 (SAM2) and reassembled into videos. These videos were then manually reviewed by the annotators with a tool that played the videos back at 1 FPS and the option to manually skip through the frames of the video. For each frame in the video that had incorrect pixel segmentations, annotators flagged these frames. Annotators then manually reviewed and fixed the pixel segmentations of the frames that were flagged. Each frame’s annotation was converted to a single PNG file, where the three classes were recorded: left hand, right hand, and background. Example of Pixel Segmentation Annotations: Evaluation Multiple semantic segmentation and gesture classification models were trained on the dataset. The official model training code and configurations for this dataset are on GitHub. The link to the public GitHub repository is provided in the Software metadata field below. Human Subjects This study was approved by the University of Texas at Austin Institutional Review Board (IRB) under the IRB ID: STUDY00000278-MOD10. To provide a comprehensive representation of collaborative scenarios, a diverse pool of participants was selected. Anyone who revoked their consent and expressed so was noted and removed from the data and the annotations. Dataset Organization The dataset is organized in the following format. It is recommended users first inspect the metadata under the metadata directory to understand which files should be used for their task. For an in-depth explanation of the dataset file structure, refer to the Dataset Report included in this dataset. Dataset Quality Statement The research team maintained high data quality by adhering to standardized procedures established at the start of dataset collection and throughout the process, ensuring consistency across all participants. All data was ethically sourced using approved protocols that prioritize participant welfare and informed consent. Comprehensive documentation was maintained during data collection to ensure traceability and facilitate auditing. All dataset contents were thoroughly documented in this report and associated repositories, ensuring transparency and reproducibility. Further Information More details could be found in the complete dataset report attached and linked below: https://dataverse.tdl.org/api/access/datafile/760102 Download Dataset 1. Install Helper Script Dependencies Create and activate a conda environment `conda create -n dataset-dl python==3.8` `conda activate dataset-dl` Install python dependencies `pip install pyDataverse pandas requests` 2. Setup TDR API KEY Click on your name's drop down menu in the top right corner and select "API Token" Generate and copy the API key. In your terminal, create a TDR API key environment variable with the following command `export TDR_API_KEY=<api_key>` 3. Download and Run Helper Script Create a base directory on your machine `mkdir EgoNRG && cd EgoNRG` Download the python script from this TDR repo `wget --header="X-Dataverse-key: $TDR_API_KEY" -O "download_dataset.py" "https://dataverse.tdl.org/api/access/datafile/773700"` Run the script `python3 download_dataset.py ['--all', '--vids', '--imgs', '--masks', '--anns']`
Subject	Engineering; Computer and Information Science
Keyword	Computer Vision Dataset, Egocentric Gesture Recognition, Egocentric Gesture Classification, Hand Segmentation, Arm Segmentation, Joint Hand & Arm Segmentation, Egocentric Vision, First-Person Viewpoint Dataset, Industrial Environment, Flame-Resistant Clothing, Military Camouflage, Gloves, Sleeves, Multi-Viewpoint, Army Field Manual Gestures, Military Hand Signals, Tactical Gestures, Non-Verbal Commands, Deictic Gestures, Pointing Gestures
License/Data Use Agreement	CC0 1.0

Change View

Table

Tree

Filter by

	1 to 10 of 3,249 Files	Download
	download_dataset.py Python Source Code - 6.9 KB Published Jul 25, 2025 0 Downloads MD5: 55e1d9e847f6aa7e4baa412181f9338b Helper script to easily download the dataset.	Access File File Access Public Download Options Python Source Code Download Metadata Data File Citation EndNote XML RIS BibTeX
	egonrg_dataset_details.png PNG Image - 2.5 MB Published Jul 17, 2025 86 Downloads MD5: 85af86a6b73be957b29901b57a170f9f	Preview "egonrg_dataset_details.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	EgoNRG_Dataset_Report_v2.pdf Adobe PDF - 830.3 KB Published Jul 17, 2025 0 Downloads MD5: 64a51af8a5f33bb29faf01eef1ba4a16	Preview "EgoNRG_Dataset_Report_v2.pdf" Access File File Access Public Download Options Adobe PDF Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options Read Document
	egonrg_dataset_structure.png PNG Image - 152.2 KB Published Jul 17, 2025 88 Downloads MD5: 79c22965af7b35c5499826c4215badad	Preview "egonrg_dataset_structure.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	example_annotations_v2.png PNG Image - 3.7 MB Published Jul 17, 2025 85 Downloads MD5: 972df9a72d2551e653551a4c2ad11f7f	Preview "example_annotations_v2.png" Access File File Access Public Download Options PNG Image Download Metadata Data File Citation EndNote XML RIS BibTeX Explore Options View Image
	Covered.zip data/annotations/Indoor/No_People/ZIP Archive - 11.2 MB Published Jul 17, 2025 7 Downloads MD5: 3162a4426ad9d29501c85becc5221944	Access File File Access Public Download Options ZIP Archive Download Metadata Data File Citation EndNote XML RIS BibTeX
	Uncovered.zip data/annotations/Indoor/No_People/ZIP Archive - 12.5 MB Published Jul 17, 2025 4 Downloads MD5: 0347b7a3be091edcd79c13da10b3772b	Access File File Access Public Download Options ZIP Archive Download Metadata Data File Citation EndNote XML RIS BibTeX
	Covered.zip data/annotations/Indoor/People/ZIP Archive - 11.6 MB Published Jul 17, 2025 11 Downloads MD5: 29eb58deba780445a4233e74eb5838c0	Access File File Access Public Download Options ZIP Archive Download Metadata Data File Citation EndNote XML RIS BibTeX
	Uncovered.zip data/annotations/Indoor/People/ZIP Archive - 12.3 MB Published Jul 17, 2025 4 Downloads MD5: 1b1a44fa2d672cc8a225599b0c3bcec2	Access File File Access Public Download Options ZIP Archive Download Metadata Data File Citation EndNote XML RIS BibTeX
	Covered.zip data/annotations/Outdoor/No_People/ZIP Archive - 10.8 MB Published Jul 17, 2025 10 Downloads MD5: 9be9a10bd9266977ef8e119fe1c7de4f	Access File File Access Public Download Options ZIP Archive Download Metadata Data File Citation EndNote XML RIS BibTeX

Citation Metadata

Persistent Identifier	doi:10.18738/T8/DC4J0Q
Publication Date	2025-07-17
Title	EgoNRG: An Egocentric Multi-View Hand-Arm Segmentation and Classification Dataset for Real-World HRI in Military and Industrial Settings
Subtitle	A Robust Dataset for Joint Hand-Arm Segmentation and Gesture Classification with Long Sleeve and Gloves Protective Clothing for Real-World Industrial and Military Environments
Author	https://ror.org/00hj54h04https://orcid.org/0000-0003-3478-6614 https://ror.org/00hj54h04https://orcid.org/0009-0003-3169-5650 https://ror.org/00hj54h04https://orcid.org/0009-0008-2089-8969 https://ror.org/00hj54h04https://orcid.org/0009-0008-1895-2348 https://ror.org/00hj54h04https://orcid.org/0009-0008-1610-9023 https://ror.org/00hj54h04https://orcid.org/0009-0002-3779-6105 https://ror.org/00hj54h04https://orcid.org/0009-0007-5378-2973 https://ror.org/00hj54h04https://orcid.org/0009-0007-8893-2046 https://ror.org/00hj54h04https://orcid.org/0000-0001-5089-9964
Point of Contact	Use email button above to contact. Esteva, Maria (University of Texas at Austin)
Description	Introduction The Egocentric Navigation Robot Gestures (EgoNRG) dataset is an egocentric hand gesture dataset designed to improved Human-Robot Interactions (HRI) in real-world industry, military, and first response applications. It contains 3,000 classified gesture videos and 160,000 pixel-based segmented images captured from 32 different participants. The participants were captured performing 11 non-verbal gestures adopted from the Army Field Manual and 1 generic, deictic, pointing gesture referencing abstract objects in indoor and outdoor environments. Highlights: Joint hand and arm segmentations of each participants' left and right limb. Participants' performed gestures with 1) long sleeves and gloves (wearing replica flame-resistant solid color clothing and military camouflage) and 2) bare skin to mimic conditions in real-world industrial and military environments. Environments with and without background people visible. Data captured in both indoor and outdoor environment at various points throughout the day (morning, midday, and dusk). Data captured from four synchronized monochrome cameras each with a different perspective. Gesture performed map directly to standard ground vehicle robot commands (stop, move forward, go left, move in reverse, etc.). Content The dataset contains: Videos from 32 participants (14 females / 18 males) in total performing 12 gestures in total. Participants were split into 4 groups of 8. Each group performed a set of 4 gestures. 3,044 (~2.5 hours) videos in total annotated with gesture type. Each gesture performed by each participant has four different recorded synchronized viewpoints associated with that gesture. 160,639 annotated frames with "Left Limb" and "Right Limb" pixel-based segmentations. The hands and arms of the participants were segmented together to create a joint segmentation for each respective limb. Collection Method The dataset was collected using the 4 VLC Monochrome cameras attached to the Microsoft HoloLens 2 headset. Each video stream provides an egocentric view of the participants hands and arms performing a wide variety of gestures from different perspectives. The perspectives include a wide left, central left, central right, and wide right camera that allows for detailed visual information of the gestures being performed across multiple cameras from multiple viewpoints. The headset streamed the video data to a remote server where the recorded data was synchronized and saved. Research assistants started and stopped the recording locally on board the headset via remote scripts. Three research assistants in total were tasked with the collection of the data over two months. Annotations The data was manually annotated by nine researchers. Three classes were assigned to each image: left limb, right limb, and background. Human annotators were instructed to annotate each limb as the joint hand and arm for all images they could tell the hand/arm of the participant was in the image. There were three steps to the annotation pipeline. The first step for the human annotators was to review left limb and right limb bounding boxes that were automatically generated using text prompts with GroundingDINO. Once the bounding boxes for each frame were varied, these images were then automatically segmented via Segment Anything 2 (SAM2) and reassembled into videos. These videos were then manually reviewed by the annotators with a tool that played the videos back at 1 FPS and the option to manually skip through the frames of the video. For each frame in the video that had incorrect pixel segmentations, annotators flagged these frames. Annotators then manually reviewed and fixed the pixel segmentations of the frames that were flagged. Each frame’s annotation was converted to a single PNG file, where the three classes were recorded: left hand, right hand, and background. Example of Pixel Segmentation Annotations: Evaluation Multiple semantic segmentation and gesture classification models were trained on the dataset. The official model training code and configurations for this dataset are on GitHub. The link to the public GitHub repository is provided in the Software metadata field below. Human Subjects This study was approved by the University of Texas at Austin Institutional Review Board (IRB) under the IRB ID: STUDY00000278-MOD10. To provide a comprehensive representation of collaborative scenarios, a diverse pool of participants was selected. Anyone who revoked their consent and expressed so was noted and removed from the data and the annotations. Dataset Organization The dataset is organized in the following format. It is recommended users first inspect the metadata under the metadata directory to understand which files should be used for their task. For an in-depth explanation of the dataset file structure, refer to the Dataset Report included in this dataset. Dataset Quality Statement The research team maintained high data quality by adhering to standardized procedures established at the start of dataset collection and throughout the process, ensuring consistency across all participants. All data was ethically sourced using approved protocols that prioritize participant welfare and informed consent. Comprehensive documentation was maintained during data collection to ensure traceability and facilitate auditing. All dataset contents were thoroughly documented in this report and associated repositories, ensuring transparency and reproducibility. Further Information More details could be found in the complete dataset report attached and linked below: https://dataverse.tdl.org/api/access/datafile/760102 Download Dataset 1. Install Helper Script Dependencies Create and activate a conda environment `conda create -n dataset-dl python==3.8` `conda activate dataset-dl` Install python dependencies `pip install pyDataverse pandas requests` 2. Setup TDR API KEY Click on your name's drop down menu in the top right corner and select "API Token" Generate and copy the API key. In your terminal, create a TDR API key environment variable with the following command `export TDR_API_KEY=<api_key>` 3. Download and Run Helper Script Create a base directory on your machine `mkdir EgoNRG && cd EgoNRG` Download the python script from this TDR repo `wget --header="X-Dataverse-key: $TDR_API_KEY" -O "download_dataset.py" "https://dataverse.tdl.org/api/access/datafile/773700"` Run the script `python3 download_dataset.py ['--all', '--vids', '--imgs', '--masks', '--anns']`
Subject	Engineering; Computer and Information Science
Keyword	Computer Vision Dataset Egocentric Gesture Recognition Egocentric Gesture Classification Hand Segmentation Arm Segmentation Joint Hand & Arm Segmentation Egocentric Vision First-Person Viewpoint Dataset Industrial Environment Flame-Resistant Clothing Military Camouflage Gloves Sleeves Multi-Viewpoint Army Field Manual Gestures Military Hand Signals Tactical Gestures Non-Verbal Commands Deictic Gestures Pointing Gestures
Language	English
Production Date	2025-03-12
Production Location	Austin, TX
Contributor	Research Group : Nuclear and Applied Robotics Group
Distributor	Nuclear and Applied Robotics Group (The University of Texas at Austin) (NRG) https://robotics.me.utexas.edu/
Depositor	Esteva, Maria
Deposit Date	2025-07-02
Date of Collection	Start Date: 2025-03-12 ; End Date: 2025-04-25
Data Type	Combined Hand/Arm Pixel-based Image Segmentations; Segmentation Based Gesture Classification
Software	Label Studio Dataset Review Website

Geospatial Metadata

Geographic Coverage	United States, Texas, Austin
Geographic Unit	Urban

Social Science and Humanities Metadata

Universe	Total: 32 Participants Age: 19 - 56 Years Old Sex: 14 Females / 18 Males Races: White, Black or African American, Asian, American Indian

Dataset Terms

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Creative Commons CC0 1.0 Universal Public Domain Dedication. CC0 1.0

Dataset Version	Summary	Contributors	Published on
No records found.

Edit File

This file has already been deleted (or replaced) in the current version. It may not be edited.

Restrict Access

Restricting limits access to published files. People who want to use the restricted files can request access by default. If you disable request access, you must add information about access to the Terms of Access field.

Learn about restricting files and dataset access in the User Guide.

Request Access

Enable access request

You must enable request access or add terms of access to restrict file access.

Terms of Access for Restricted Files

Save Changes

Edit Embargo

The selected file or files have already been published. Contact an administrator to change the embargo date or reason of the file or files.

Edit Retention Period

The selected file or files have already been published. Contact an administrator to change the retention period date or reason of the file or files.

Delete Files

The file will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Select File(s)

Please select one or more files.

Share Dataset

Share this dataset on your favorite social media networks.

Continue

Dataset Citations

Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.

Sorry, no citations were found.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired or the files can only be transferred via Globus.

You may request access to any restricted file(s) by clicking the Request Access button.

Ineligible Files Selected

The selected file(s) may not be transferred because you have not been granted access or the file(s) have a retention period that has expired or the files are not Globus accessible.

You may request access to any restricted file(s) by clicking the Request Access button.

Download Options

The files selected are too large to download as a ZIP.

You can select individual files that are below the 1.9 GB download limit from the files table, or use the Data Access API for programmatic access to the files.

Select File(s)

Please select a file or files to be downloaded.

Inaccessible Files Selected

The selected file(s) may not be downloaded because you have not been granted access or the file(s) have a retention period that has expired.

Click Continue to download the files you have access to download.

Ineligible Files Selected

Some file(s) cannot be transferred. (They are restricted, embargoed, with an expired retention period, or not Globus accessible.)

Click Continue to transfer the elligible files.

Delete Dataset

Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.

Delete Draft Version

Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.

Unpublished Dataset Preview URL

Preview URL can only be used with unpublished versions of datasets.

Unpublished Dataset Preview URL

Are you sure you want to disable the Preview URL? If you have shared the Preview URL with others they will no longer be able to use it to access your unpublished dataset.

Delete Files

The file(s) will be deleted after you click on the Delete button.

Files will not be removed from previously published versions of the dataset.

Compute

This dataset contains restricted files you may not compute on because you have not been granted access.

Deaccession Dataset

Are you sure you want to deaccession? This is permanent and the selected version(s) will no longer be viewable by the public.

Deaccession Dataset

Are you sure you want to deaccession this dataset? This is permanent an it will no longer be viewable by the public.

Version Differences Details

Please select two versions to view the differences.

Version Differences Details

Version:
Last Updated:

Select File(s)

Please select a file or files for access request.

Select File(s)

Embargoed files cannot be accessed. Please select an unembargoed file or files for your access request.

Edit Tags

Select existing file tags or create new tags to describe your files. Each file can have more than one tag.

Request Access

You need to Log In to request access.

Dataset Terms

Please confirm and/or complete the information needed below in order to request access to files in this dataset.

This dataset is made available under the following terms. Please confirm and/or complete the information needed below in order to continue.

License/Data Use Agreement

Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation shown on the dataset page.

Creative Commons CC0 1.0 Universal Public Domain Dedication. CC0 1.0

Preview Guestbook

Upon downloading files the guestbook asks for the following information.

Guestbook Name

Collected Data

Account Information

Package File Download

Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL

Download URL

https://dataverse.tdl.org/api/access/datafile/

Compute Batch

Clear Batch

Dataset	Persistent Identifier	Change Compute Batch

Compute Batch

Submit for Review

You will not be able to make changes to this dataset while it is in review.

Publish Dataset

Are you sure you want to republish this dataset?

Select if this is a minor or major version update.

Minor Release (2.1)

Major Release (3.0)

Publish Dataset

This dataset cannot be published until Texas Robotics is published by its administrator.

Publish Dataset

This dataset cannot be published until Texas Robotics and University of Texas at Austin Dataverse Collection are published.

Return to Author

Return this dataset to contributor for modification. The reason for return entered below will be sent by email to the author.

EgoNRG: An Egocentric Multi-View Hand-Arm Segmentation and Classification Dataset for Real-World HRI in Military and Industrial Settings

Introduction

Content

Collection Method

Annotations

Evaluation

Human Subjects

Dataset Organization

Dataset Quality Statement

Further Information

Download Dataset

1. Install Helper Script Dependencies

2. Setup TDR API KEY

3. Download and Run Helper Script