Extracting audio features Pipelines

Extracting audio features Pipelines

2024. 5. 18. 13:14ㆍAudio Signal Processing for ML

Time-domain features

https://www.youtube.com/watch?v=8A-W1xk7qs8&list=PL-wATfeyAMNqIee7cH3q1bh4QJFAaeNv0&index=6

First, we should implement ADC procedure, which converts analog signal to digital signal.

Next, there is a framing procedure, which is needed in order to generate perceivable audio chunks(note that our ear's time resolution is about 10ms)

It's noticeable that each frame overlaps with another. The reason will be revealed later.

After framing, feature computation and it's aggregation will be performed.

Frequency-domain features

The main pipeline is similar to' that of time-domain features.

The difference is that there is a windowing process.

When we perform short time fourier transform, a problem called "spectral leakage" occurs.

Spectral leakage is a problem that occurs when a soundwave is not a periodic(thus, discontinuities exist.)

When we calculate STFT of this kind of signal, the extracted frequencies will contain relatively high frequencies that are not included in the original signal.

To prevent this problem, we use something called "windowing", which is a function that is multiplied to the signal. The most common windowing function is Hann Window.

https://www.chegg.com/homework-help/questions-and-answers/8-signal-consisting-four-sinusoids-frequencies-1-15-25-275-khz-sampled-rate-10-khz-minimum-q40953175

But this solution also makes another problem to arise, which is a loss of information in the original signal. This is why we overlap each frame to preserve information every single sample.

'Audio Signal Processing for ML' 카테고리의 다른 글

Time Domain features (0)	2024.05.20
ADC(Analog to Digital Conversion) (0)	2024.05.18
Basic features of sound wave (0)	2024.05.14
Audio Signal Processing for ML - Introduction (0)	2024.05.12

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

joey_tpop

joey_tpop

태그

최근글

댓글

공지사항

아카이브

'Audio Signal Processing for ML' 카테고리의 다른 글

관련글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역