in A.I.

Is Facebook working on a new AI model that predicts future actions?

Facebook has introduced Anticipative Video Transformer (AVT), a new machine-learning method that uses visual interpretation to forecast future behaviors.

by IOI November 21, 2021, 4:33 pm 7.7k Views 0 Comments

Facebook has introduced Anticipative Video Transformer (AVT), a new machine-learning method that uses visual interpretation to forecast future behaviours. AVT is an attention-based approach for action anticipation in videos that operates from beginning to conclusion.

The new model is built on recent developments in transformer topologies, notably for natural language processing and image modelling for self-driving vehicles and augmented reality applications.

AVT examines an action to determine the likely outcome, with a focus on AR and the metaverse. Through APIs that allow programmes to communicate with one another, Facebook intends for its metaverse apps to function across platforms and devices.

Future activity prediction is a tough problem for AI since it requires both forecasting the multimodal distribution of future activities and modelling the trajectory of existing actions.

Because AVT is attention-based, it may analyse an entire sequence in parallel, whereas recurrent-neural-network-based techniques must process sequences sequentially and sometimes forget the past. Loss functions in AVT allow the model to capture the sequential character of video, which would otherwise be lost in attention-based designs like nonlocal networks.

AVT is made up of two parts: an attention-based backbone (AVT-b) that works with video frames and an attention-based head architecture (AVT-h) that works with the backbone’s features.

The vision transformer (VIT) architecture is used to build the AVT-b backbone. It divides frames into non-overlapping patches, uses a feedforward network to embed them, adds a particular categorization token, and applies many levels of multihead self-attention. The head architecture takes the per-frame characteristics and applies a causal attention transformer architecture. This implies it only considers characteristics from the current and previous frames. As a result, the model can generate a representation of every specific frame only based on previous characteristics.

AVT might be utilised as an augmented reality action coach or as an AI assistant that warns individuals before they make mistakes. AVT might also be useful for tasks other than anticipation, such as self-supervised learning, discovering action schemas and bounds, and even general action recognition in tasks that involve modelling the temporal sequence of activities.

Written by IOI

Get the latest stories from Tech & Innovation from around the globe. Subscribe Now!

Is Facebook working on a new AI model that predicts future actions?

Facebook has introduced Anticipative Video Transformer (AVT), a new machine-learning method that uses visual interpretation to forecast future behaviors.

Written by IOI

As the cryptocurrency market approaches a tipping point, eToro has launched a $20 million NFT fund.

After several delays, NASA modifies critical testing of the mega moon rocket!

Ajay Devgn’s Runway 34 is the first exclusive NFT drop on Hefty Marketplace!

Pick your AI stocks: Alphabet vs Apple!

Two new exoplanets of Saturn’s mass have been discovered!

US Space Command confirms that a meteorite that hit Earth in 2014 was alien.

Reinvesting $140 Billion in AI: A Second Attempt at Success

How Captcha’s are evolving in the AI Realm!

How Celebrity AI and Contractual AI Clones with Celebrities can change Hollywood!

AI Imagines Rock Dwayne Johnson in a Life Montage!

The AI Paradox: Fostering Creativity or Forging Competition?

AI and its possible Dangers or may be not as much as they tell you? IOI Reports

Leave a Reply Cancel reply

Did you know that increased blood sugar can show up on your skin? Diagnose diabetes this way.

What If Humanity Was a Type VII Civilization?

Hike’s blockchain gaming platform now includes NFT avatars!

Telugu actress Lakshmi Manchu joins the WazirX NFT Marketplace!

For 1.3 million dollars, Justin Bieber bought a Bored Ape Yacht Club NFT!

How Captcha’s are evolving in the AI Realm!

Unveiling the Hidden World of Online Earnings: Top 8 Secret Websites to Make Money in 2023

Elevating Horror Gaming: Unreal Engine 5.3 Unleashes Next-Level Realism and Immersion

Unlocking the Secrets of Human Longevity: How Virtual Cells Are Changing the Game

Reinvesting $140 Billion in AI: A Second Attempt at Success

The AI Paradox: Fostering Creativity or Forging Competition?

AI and its possible Dangers or may be not as much as they tell you? IOI Reports

Top 10 Blockchain Project and development programming languages!

Using AI and machine learning to solve the challenge of globalization in the entertainment industry!

Unveiling the Hidden World of Online Earnings: Top 8 Secret Websites to Make Money in 2023

Leave a Reply Cancel reply

ARE YOUA STARTUP?

ARE YOU
A STARTUP?