Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pose datasets start/end #8

Closed
AmitMY opened this issue Feb 14, 2025 · 4 comments
Closed

Pose datasets start/end #8

AmitMY opened this issue Feb 14, 2025 · 4 comments
Labels
request A modification or the implementation of a new feature is requested.

Comments

@AmitMY
Copy link
Contributor

AmitMY commented Feb 14, 2025

It is most common to get data in subtitles format, or other time aligned formats

Therefore, when something like this:

happen, it forces the user to do unnecessary calculation. (also, for videos for example)

Instead, please use start_time and end_time which are in milliseconds.

And either way, they are not floats, but ints

"source_start": Optional[float],
"source_end": Optional[float],

@GerrySant
Copy link
Owner

This is something I have thought about a lot. The problem is that to make the conversion to frames (useful for some modalities) I need to know the fps of the input file. I would like to find a way to extract the fps directly from the file (be it mp4 or whatever).

I am aware that for poses, pose-format allows to specify start and end in milliseconds. The problem is currently in video (which is not fully implemented at the moment, but will extend image2text).

I know it's something to look at, but I'm focusing on more priority things at the moment. Any suggestions are welcome.

@AmitMY
Copy link
Contributor Author

AmitMY commented Feb 14, 2025

for poses - specifying start and end time uses the body.fps to calculate
https://github.com/sign-language-processing/pose/blob/e07ca689547e228952ca4dbad17dab591df33bc9/src/python/pose_format/pose_body.py#L239

for videos - depending on the use case - can perform the same thing by opening the video and asking for the fps - thing is, videos don't necessarily have constant fps, and then, it is way more important to work with times.


I feel like working with frames burdens the user with one more processing step (since most data is time aligned) while time is natural

@GerrySant
Copy link
Owner

Noted, upcoming commits will change default unit to seconds.

@AmitMY
Copy link
Contributor Author

AmitMY commented Feb 14, 2025

milliseconds is better*
they are integers, and easier to work with in general

@GerrySant GerrySant added the request A modification or the implementation of a new feature is requested. label Feb 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
request A modification or the implementation of a new feature is requested.
Projects
None yet
Development

No branches or pull requests

2 participants