关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

litingyu dac2427fcf fix trainonestepbug for 2.3 (#183 )		1 month ago
.github	fix(workflow): migrate pypi authentication to api tokens	3 months ago

docs	fix CI scan problems	1 year ago

examples	fix trainonestepbug for 2.3 (#183)	1 month ago

mindaudio	fix trainonestep bug for 2.2	3 months ago

tests	remove test_dataio plot test	10 months ago

tutorials	modify tutorials for audio data processing	10 months ago

.flake8	change ci scan problems	1 year ago

.gitignore	fix CI scan problems	1 year ago

.pre-commit-config.yaml	add isort profile	1 year ago

CONTRIBUTING.md	fix CI scan problems	1 year ago

LICENSE	fix CI scan problems	1 year ago

README.md	apply isort	10 months ago

package.sh	fix CI scan problems	1 year ago

requirements-dev.txt	update readme	7 months ago

requirements.txt	update readme & requirements	7 months ago

setup.py	modify typo of setup file	10 months ago

README.md

MindAudio

MindAudio

GitHub Workflow Status
GitHub issues
GitHub pull requests
GitHub

Introduction |
Installation |
Get Started

Introduction

MindAudio is a toolbox of audio models and algorithms based on MindSpore. It provides a series of API for common audio data processing,data enhancement,feature extraction, so that users can preprocess data conveniently. Also provides examples to show how to build audio deep learning models with mindaudio.

data processing

# read audio
>>> import mindaudio.data.io as io
>>> audio_data, sr = io.read(data_file)
# feature extraction
>>> import mindaudio.data.features as features
>>> feats = features.fbanks(audio_data)

Installation

Install with PyPI

The released version of MindAudio can be installed via PyPI as follows:

pip install mindaudio

Install from Source

The latest version of MindAudio can be installed as follows:

git clone https://github.com/mindspore-lab/mindaudio.git
cd mindaudio
pip install -r requirements/requirements.txt
python setup.py install

Get started with audio data analysis

mindaudio provides a series of commonly used audio data processing apis, which can be easily invoked for data analysis and feature extraction.

>>> import mindaudio.data.io as io
>>> import mindaudio.data.spectrum as spectrum
>>> import numpy as np
>>> import matplotlib.pyplot as plt
# read audio
>>> audio_data, sr = io.read("./tests/samples/ASR/BAC009S0002W0122.wav")
# feature extraction
>>> n_fft = 512
>>> matrix = spectrum.stft(audio_data, n_fft=n_fft)
>>> magnitude, _ = spectrum.magphase(matrix, 1)
# display
>>> x = [i for i in range(0, 256*750, 256)]
>>> f = [i/n_fft * sr for i in range(0, int(n_fft/2+1))]
>>> plt.pcolormesh(x,f,magnitude, shading='gouraud', vmin=0, vmax=np.percentile(magnitude, 98))
>>> plt.title('STFT Magnitude')
>>> plt.ylabel('Frequency [Hz]')
>>> plt.xlabel('Time [sec]')
>>> plt.show()

Result presentation:

What's New

2023/06/24: version 0.1.1, bug fix and readme update
2023/03/30: version 0.1.0, including 50+ data processing APIs, 5 models supported.
2022/09/30: beta, 33 data APIs + 3 models

Contributing

We appreciate all contributions to improve MindSpore Audio. Please refer to CONTRIBUTING.md for the contributing guideline.

License

This project is released under the Apache License 2.0.

Citation

If you find this project useful in your research, please consider citing:

@misc{MindSpore Audio 2022,
    title={{MindSpore Audio}:MindSpore Audio Toolbox and Benchmark},
    author={MindSpore Audio Contributors},
    howpublished = {\url{https://github.com/mindspore-lab/mindaudio}},
    year={2022}
}