Biodiversity evaluation and monitoring is the first step toward its protection. The goal of the Microfaune project is to evaluate avifauna in Cité Universitaire park (Paris, France) from audio recordings.
The aim is to provide the scientific community with a labeled database (bird/no-bird) and to develop machine learning algorithms for bird audio detection. The project includes also the creation (from scratch or using existing tools) of web-based tools to label and visualize bird sounds.
The goal is to advance leverage state of the art research and contribute to open data in the field.
Here is a tentative roadmap (to be challenged!):
Since we'll work on audio data, it should be useful to have notions on basic operations on audio. We provide an introductory notebook showing how to load audio data, listen to it in the notebook, plot waveforms, calculate spectrograms, etc.
Here is a notebook to get started! Feel free to contribute and improve this notebook!
Using pipenv to create virtual environment. You need to have pipenv
installed:
On mac:
brew install pipenv
Other plateforms: see the doc.
Then, you can fork or clone this repo and run:
pipenv install
And you should have all dependencies installed! (Locking can take some time so be patient!).
To install a lib you need, e.g:
pipenv install numpy
To run Python from the virtual environment:
pipenv run ...
For instance, to run a Jupyter notebook:
pipenv run jupyter notebook
If you look on the roadmap, you should see some sub-projects using external data. These data are audio files labelised with the presence or absence of bird. There are at least two databases:
- Warblr
- FreeField1010
Some ideas can be tried on these data making easy comparison of performances.
You can find these databases here.
Wazo Data are data from recordings in Cité Universitaire in Paris. Data should be accessible soon.
Here is the data from the scientific survey from 2014: