Learnable Window #108

gunikavashisht13472 · 2021-11-03T15:05:32Z

Could you please elaborate why you have not used Learnable_window in STFT , Mel Spectrograms and MFCC but used in their inverse counterparts?

KinWaiCheuk · 2021-11-04T07:51:54Z

The learnable kernels are, by default, disable in all STFT, Mel spectrograms, MFCC, and their inverse counterparts.

If you are referring to the argument refresh_win as shown below, it is not for learnable kernels. It is for recalculating the window_sumsquare for different audio lengths, which is essential to obtained a correct inverse. If all of your audio clips are of the same length, you can make refresh_win=False to speed up the calculation a little bit.

nnAudio/Installation/nnAudio/Spectrogram.py

Line 47 in e3ad18d

    
           def inverse_stft(self, X, kernel_cos, kernel_sin, onesided=True, length=None, refresh_win=True):

gunikavashisht13472 · 2021-11-04T08:53:23Z

Thank you for the answer. I want to use RNNS instead of CNNs in my maodel. Will this code work for RNNs too?

KinWaiCheuk · 2021-11-05T05:52:04Z

Thank you for the answer. I want to use RNNS instead of CNNs in my maodel. Will this code work for RNNs too?

Yes it will work. nnAudio is just for spectrogram extraction using GPU. Once you have that spectrogram, you can use any model of your choice. The CNN in the example is just a demonstration on how to use nnAudio together with a pytorch model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learnable Window #108

Learnable Window #108

gunikavashisht13472 commented Nov 3, 2021

KinWaiCheuk commented Nov 4, 2021

gunikavashisht13472 commented Nov 4, 2021

KinWaiCheuk commented Nov 5, 2021

Learnable Window #108

Learnable Window #108

Comments

gunikavashisht13472 commented Nov 3, 2021

KinWaiCheuk commented Nov 4, 2021

gunikavashisht13472 commented Nov 4, 2021

KinWaiCheuk commented Nov 5, 2021