Free plan available

Meta Segment Anything Model 2

Unified AI model for real-time object segmentation in images and videos.

Visitai.meta.com
Intro

What is Meta Segment Anything Model 2?

Meta Segment Anything Model 2 (SAM 2) is a cutting-edge, open-source unified model designed by Meta AI for fast and precise object segmentation across both images and videos. Building upon the foundational breakthroughs of the original SAM architecture, SAM 2 allows users to select any object using a click, box, or mask input. While tech enthusiasts often look for future iterations like sam3 or sam 3d (sam3d) capabilities, SAM 2 currently sets the state-of-the-art benchmark by introducing a per-session memory module that tracks objects seamlessly across video frames, even during occlusions. This open innovation release by Meta IA provides developers working in ecosystems like AI Studio with a powerful tool for building real-time interactive applications, precise video editing, and modern generative AI workflows like tribe v2 or meta muse spark (muse spark).

Meta Segment Anything Model 2 at a glance
Free1.7M monthly visitsHas free access
Pricing

Meta Segment Anything Model 2 Pricing Plans

Compare Meta Segment Anything Model 2 free options, Meta Segment Anything Model 2 paid pricing plans, and usage notes before you choose the best way to use this AI tool in 2026.

Free

Pricing updated:Jun 11, 2026

Features

Meta Segment Anything Model 2 AI Features

Unified segmentation for both static images and fluid video framesInteractive prompting via click, bounding box, or mask selectionPer-session memory module for robust tracking across video frames, even during temporary occlusionsZero-shot performance enabling immediate application on unfamiliar videos and unseen objectsStreaming architecture designed for real-time interactivity and fast inference speedOpen-source release under an Apache 2.0 license with access to the SA-V dataset
Pros & Cons

Meta Segment Anything Model 2 Pros and Cons

Pros

  • Outperforms existing video object segmentation models, particularly for tracking parts
  • Requires significantly less interaction time than traditional interactive video segmentation methods
  • Geographically diverse training data (SA-V dataset) collected across 47 countries ensures strong real-world representation
  • Extensible outputs that integrate smoothly with modern video generation models for precise editing

Limitations

  • Requires high-performance hardware for independent local deployment and streaming inference
  • May require manual correction prompts on highly complex or completely obscured video sequences

Meta Segment Anything Model 2 FAQ

SAM 2 is a unified model specializing in 2D image and video object tracking. However, because its outputs are highly extensible, developers frequently integrate it with other Meta IA systems, AI Studio environments, and experimental pipelines like muse spark (meta muse spark) or tribe v2 to generate masks that assist in spatial and 3D reconstruction tasks.