GTCRN - アペフチ

GTCRN

gtcrn_simple.onnx

code:ruby

session = OnnxRuntime::InferenceSession.new("gtcrn_simple.onnx")

pp session

@allocator=#<FFI::Pointer address=0x000000012785f990>,

@inputs=

[{name: "mix", type: "tensor(float)", shape: 1, 257, 1, 2},

{name: "conv_cache", type: "tensor(float)", shape: 2, 1, 16, 16, 33},

{name: "tra_cache", type: "tensor(float)", shape: 2, 3, 1, 1, 16},

{name: "inter_cache", type: "tensor(float)", shape: 2, 1, 33, 16}],

@outputs=

[{name: "enh", type: "tensor(float)", shape: 1, 257, 1, 2},

{name: "conv_cache_out", type: "tensor(float)", shape: 2, 1, 16, 16, 33},

{name: "tra_cache_out", type: "tensor(float)", shape: 2, 3, 1, 1, 16},

{name: "inter_cache_out", type: "tensor(float)", shape: 2, 1, 33, 16}],

@session=#<FFI::AutoPointer address=0x0000000bab62aa00>>

mixはWAVEファイルから読み込んだただのサンプルの配列でいいっぽい。

Melスペクトログラムとかにはしなくていい

ビット深度は調整する必要はあるかも

を見ると、conv_cacheとかは用意すると言っても単に0で埋めればいいっぽい。

を見るとフーリエ変換しているな……