Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Voice recognition (fixes #58) #59

Draft
wants to merge 18 commits into
base: master
Choose a base branch
from
Draft

Voice recognition (fixes #58) #59

wants to merge 18 commits into from

Conversation

Mikolaj
Copy link
Owner

@Mikolaj Mikolaj commented Jul 5, 2022

Implements #58. An attempt to recognize to which person a voice belongs in a given window of a sound file. Uses RNN.

The tests to run are cabal test extremelyLongTest --enable-optimization -f test_seq --test-options='-p "Speech RNN"''. Example data files not yet included.

@Mikolaj Mikolaj force-pushed the voice-recognition branch from 7ed4e58 to f5ce66a Compare July 5, 2022 22:27
@blackhole64
Copy link

What is the status of this PR?

@Mikolaj
Copy link
Owner Author

Mikolaj commented Feb 17, 2023

I'm afraid, it's very outdated and the new tools it should use instead of the old ones are not ready.

@blackhole64
Copy link

Which tools? Is there a PR/issue for those?

@Mikolaj
Copy link
Owner Author

Mikolaj commented Feb 17, 2023

To be frank, it's not yet clear what those tools should be. Benchmarks are going to show if more work is needed, but benchmarks first need to be ported from the old API and also new ones created. The new API is not finished yet, though. Once it's finished in its current form, ideally we'd add a shaped, not only ranked version, but this may be too hard to do, at least initially. Once we decide and implement that, benchmarking is the next step.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants