The non-AR Ray-Ban Stories glasses launched by Meta last year. Photo: EssilorLuxottica.

Meta’s Latest in AR Audio Research; Spotify Debuts a Roblox World

By  |  May 3, 2022 11:01 AM PDT
Photo: The non-AR Ray-Ban Stories glasses launched by Meta last year. Photo: EssilorLuxottica.

Every now and then, it’s worth diving into the copious amounts of AR/VR research that Meta Platforms publishes online. Not all of it has clear ties to the company’s product ambitions, but a recently published paper reveals how Reality Labs researchers have progressed toward giving AR glasses users what Chief Scientist Michael Abrash has pitched as “audio superpowers.”

In particular, he’s talking about ways AR glasses could eventually deliver  noise-canceling and hearing assistance features. In order to make those kinds of “superpower” abilities work in a variety of situations, the glasses will need to be able to pinpoint the positions of people who are speaking. That’s not as simple as it might seem. Imagine you’re in a crowded room: figuring out who’s talking isn't always straightforward. Meta’s new paper, submitted to next month’s Conference on Computer Vision and Pattern Recognition, details a new approach to that problem, known as “active speaker localization.” 

Access on the go
View stories on our mobile app and tune into our weekly podcast.
Join live video Q&A’s
Deep-dive into topics like startups and autonomous vehicles with our top reporters and other executives.
Enjoy a clutter-free experience
Read without any banner ads.
From left, a Google TPU, Broadcom CEO Hock Tan and Google Cloud chief Thomas Kurian. Photos via Getty, Google and YouTube.
Exclusive google semiconductors
To Reduce AI Costs, Google Wants to Ditch Broadcom as Its TPU Server Chip Supplier
Google executives have extensively discussed dropping Broadcom as a supplier of artificial intelligence chips as early as 2027, according to a person with direct knowledge of the effort.
Photo via Midjourney.
AI Agenda startups ai
The Rise of Startups That Help Other Startups Evaluate LLMs
All but a handful of artificial intelligence startups typically fall into one of two camps. The first group uses a single large-language model, typically OpenAI’s GPT-4, to power their applications.
Photos via Eiso Kant (left) and YouTube/VMWare Tanzu (right)
AI Agenda startups ai
How GitHub Copilot’s Co-Creator Raised $126 Million to Compete with His Former Employer
Recent interest in artificial intelligence has focused on large-language models that aim to do everything from writing Shakespearean poetry to solving math riddles.
Art by Clark Miller
Exclusive startups entertainment
MasterClass Takes a Crash Course in Frugality
MasterClass had a problem with the shoot featuring its latest star instructor, Walt Disney Co. CEO Bob Iger.
If AI researchers can meet Nat Friedman's Vesuvius Challenge, “It’ll be the first time we’ve read handwriting that hasn’t been seen in 2,000 years.” Art by Clark Miller
The AI Age culture ai
Nat Versus the Volcano: Can an AI Investor Solve an Ancient Mystery from the Ashes of Vesuvius?
Long before men’s daily thoughts about ancient Rome became a TikTok meme , former GitHub CEO Nat Friedman’s mind was regularly turning toward the Roman Empire.
Photo via Jacopo Pantaleoni.
AI Agenda ai
Nvidia Engineer’s Message to Google AI Researchers: Leave Your Company
Jacopo Pantaleoni joined Nvidia in 2001 when the company had less than 500 employees. He worked on what was then a small research project to improve Nvidia’s graphics processing units so they could better render images on computers and gaming consoles.