Xiaomi has reached a significant milestone in sound recognition technology with its self-developed algorithm. By utilizing the publicly available AudioSet-2M dataset for training, Xiaomi’s audio tagging model has achieved a score exceeding 50 mAP for the first time, positioning it as a leader on the global stage.
The breakthrough came as Xiaomi’s sound recognition algorithm model surpassed the 50 mAP threshold within the training dataset. This accomplishment sets a new standard in audio tagging technology and highlights Xiaomi’s advancements in the field.
Additionally, Xiaomi has introduced a Mini version of the algorithm model, specifically designed for resource-constrained scenarios. Despite its smaller size, this Mini model outperforms similar models developed by other organizations.
The practical value of this technological advancement is evident in its application to Xiaomi’s smart devices, enhancing the overall user experience. The algorithm excels at recognizing a wide range of environmental sounds, including baby cries, animal noises, and car engines. It can represent these sounds in various forms, such as text.
Furthermore, Xiaomi’s robots benefit greatly from this algorithm technology. The humanoid robot, CyberOne, can now recognize 85 types of environmental sounds and perceive a wide range of human emotions through auditory sensing. The second-generation biomimetic quadruped robot, CyberDog 2, has also significantly improved its dynamic response capabilities by being able to identify 38 types of environmental sounds.
Xiaomi’s achievement in sound recognition technology showcases their commitment to innovation and their ability to push boundaries. With this breakthrough, Xiaomi is set to revolutionize the field of audio tagging and open up possibilities for improved user experiences in various applications.
– The original source article.