我如何制作一个将猫叫声翻译成人类语言的应用程序
I believe that to translate a cat’s sound, you need to distinguish if it’s actually a cat’s sound in the first place. Makes sense, doesn’t it? To achieve this, I initially considered designing a sophisticated deep learning algorithm capable of recognizing various sounds. However, I opted for a simpler route and utilized Huawei Technologies’ ML Kit Sound Detector instead. This tool can detect up to 12 different sounds, including cat sounds.
我认为,要翻译猫的声音,首先需要区分这是否真的是猫的声音。这很有道理,不是吗?为此,我最初考虑设计一个复杂的深度学习算法,能够识别各种声音。然而,我选择了一条更简单的路线,使用了华为技术的ML Kit声音检测器。这个工具可以检测多达12种不同的声音,包括猫声。
Alright, we’ve managed to distinguish between a meow and a fart sound — progress! Now, I needed a collection of cat sound examples that represent specific cat behaviors. To achieve this, I followed the work of Yagya Raj Pandeya and Joonwhoan Lee on Domestic Cat Sound Classification Using Transfer Learning. Fortunately, cats steer clear of political discussions. When they vocalize, it’s usually about particular situations or behaviors:
好吧,我们已经能够区分喵声和放屁声——进步!现在,我需要一组代表特定猫行为的猫声示例。为此,我参考了Yagya Raj Pandeya和Joonwhoan Lee关于使用迁移学习进行家猫声音分类的研究。幸运的是,猫咪避免政治讨论。当它们发声时,通常是关于特定情况或行为:
- Hunger
- 饥饿
- Eagerness to play/hunt/go out
- 渴望玩耍/狩猎/外出
- Annoyance/danger
- 烦恼/危险
- Sleepiness
- 困倦
- Contentment/happiness
- 满足/快乐
- Amorous feelings
- 爱意
- Anger
- 愤怒
Having categorized these behaviors, it’s time to gather some examples. Luckily, many people record their cats while they meow. The downside? My YouTube history is now filled with searches like:
在对这些行为进行分类后,是时候收集一些示例了。幸运的是,许多人在猫咪叫的时候录制它们。缺点?我的YouTube历史现在充满了这样的搜索:
“Horny cat sounds- 1 Hour HIGH QUALITY”
“发情猫声音- 1小时高质量”
For those wondering, this is what a horny cat sound graph looks like
对于那些好奇的人,这就是发情猫声音图的样子
Once I gathered all the raw example data in MP3 format, it was time to compare these sounds to those made by Gilbert. It’s important to note that this app is tailored specifically for Gilbert, so all the data used pertains to one-year-old male cats. I’m not entirely sure what you’ll do with this information, but there it is. Now, I needed to co...