Podcast highlight extraction

Installation

The repo was developed on MacBook M1 chip laptop. Thus, Python 3.8 and most packages, as well as the virtual environment, are managed using Conda. The list of packages is shown in the file requirements.txt.

Problem

sentencepiece might not be able to be import on Apple M1

arch -arm64 brew install cmake pip install --no-cache-dir sentencepiece

Data

Sound data are stored under the directory ./sound

Step 1: Speaker segmentation (find_speaker_segment.py)

Using SpeechBrain's speaker embeddings from the HuggingFace repo of "spkrec-ecapa-voxceleb". See https://huggingface.co/speechbrain/spkrec-ecapa-voxceleb

Step 2: Audio tagging by AudioSet (assign_audioset_labels.py)

The sound oncology is shown in the following link: https://research.google.com/audioset/ontology/index.html
Vggish and YAMNet Download those two models for AudioSet from a directory within TensorFlow repo (Trick: DownGit) https://github.com/tensorflow/models/tree/master/research/audioset

Step 3: Find highlight through the combination of sentence embedding (BERT embeddings) and music presence

Huggingface's sentence transformer https://huggingface.co/sentence-transformers/sentence-t5-large
Final score = Alpha * Music Score (0 or 1) + (1 - Alpha) * Sentence similarity

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
audioset		audioset
sound		sound
README.md		README.md
lib_utility.py		lib_utility.py
one_find_speaker_segment.ipynb		one_find_speaker_segment.ipynb
requirements.txt		requirements.txt
three_find_highlight-v1.ipynb		three_find_highlight-v1.ipynb
three_find_highlight.ipynb		three_find_highlight.ipynb
two_assign_audioset_labels.ipynb		two_assign_audioset_labels.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Podcast highlight extraction

Installation

Problem

Data

Step 1: Speaker segmentation (find_speaker_segment.py)

Step 2: Audio tagging by AudioSet (assign_audioset_labels.py)

Step 3: Find highlight through the combination of sentence embedding (BERT embeddings) and music presence

About

Releases

Packages

Languages

atoultaro/podcast_highlight

Folders and files

Latest commit

History

Repository files navigation

Podcast highlight extraction

Installation

Problem

Data

Step 1: Speaker segmentation (find_speaker_segment.py)

Step 2: Audio tagging by AudioSet (assign_audioset_labels.py)

Step 3: Find highlight through the combination of sentence embedding (BERT embeddings) and music presence

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages