CameraX video capturing architecture

A capturing system generally records video and audio streams, compresses them, combines (or muxes) the two streams into one stream, and then writes the resultant stream to disk.

conceptual diagram for a video and audio capturing system
Figure 1. Conceptual diagram for a video and audio capturing system.

In CameraX, the solution for video capturing is the VideoCapture use case:

conceptual diagram that shows how camera x handles the
         video capture use case
Figure 2. Conceptual diagram that shows how CameraX handles the VideoCapture use case.

As shown in figure 2, CameraX video capture includes a few high-level architectural components:

  • SurfaceProvider for the video source.
  • AudioSource for audio source.
  • Two encoders to encode and compress video/audio.
  • A media muxer to mux the two streams.
  • A file saver to write out the result.

The VideoCapture API abstracts the complex capturing engine and provides applications with a much simpler and straightforward API.

VideoCapture API overview

VideoCapture is a CameraX use case that works well on its own or when combined with other use cases. Specific supported combinations depend on the camera hardware capabilities, though Preview and VideoCapture is a valid use case combination on all devices.

The VideoCapture API consists of the following objects that communicate with applications:

  • VideoCapture is the top-level use case class. VideoCapture binds to a LifecycleOwner with a CameraSelector and other CameraX UseCases. For more information about these concepts and usages, see CameraX Architecture.
  • A Recorder is an implementation of VideoOutput that is tightly coupled with VideoCapture. Recorder is used to perform the video and audio capturing. An application creates recordings from a Recorder.
  • A PendingRecording configures a recording, providing options like enabling audio and setting an event listener. You must use a Recorder to create a PendingRecording. A PendingRecording does not record anything.
  • An ActiveRecording performs the actual recording. You must use a PendingRecording to create and ActiveRecording.

Figure 3 shows the relationships between these objects:

diagram showing the interactions that occur in a video
         capture use case
Figure 3. Diagram showing the interactions that occur in a VideoCapture use case.


  1. Create a Recorder with QualitySelector.
  2. Configure the Recorder with one of the OutputOptions.
  3. Enable audio with withAudioEnabled() if needed, and register VideoRecordEvent listener with withEventListener()
  4. Call start() to begin recording.
  5. Use pause()/resume()/stop() on the ActiveRecording to control the recording.
  6. Respond to VideoRecordEvents inside your event listener.

The detailed API list is in the current.txt inside the source code.

Using the VideoCapture API

To integrate the CameraX VideoCapture use case into your app, do the following:

  1. Bind VideoCapture.
  2. Prepare and configure recording.
  3. Start and control the runtime recording.

The following sections outline what you can do at each step to get an end-to-end recording session.

Bind VideoCapture

To bind the VideoCapure use case, do the following:

  1. Create a Recorder object.
  2. Create VideoCapture object.
  3. Bind to a Lifecycle.

CameraX VideoCapture API follows the builder design pattern. Applications use Recorder.Builder to create a Recorder. You can also configure the video resolution for the Recorder through a QualitySelector object.

CameraX Recorder supports the following resolutions:

  • QualitySelector.QUALITY_UHD for 4K ultra HD video size (2160p)
  • QualitySelector.QUALITY_FHD for full HD video size (1080p)
  • QualitySelector.QUALITY_HD for HD video size (720p)
  • QualitySelector.QUALITY_SD for SD video size (480p)

Note that CameraX can also choose other resolutions when authorized by the app.

The exact video size of each selection depends on the camera and encoder's capabilities. For more information, see the documentation for CamcorderProfile.

Applications can configure resolution by creating a QualitySelector. You can create a QualitySelector using one of the following methods:

  • Try a few preferred resolutions by using firstTry() for the most preferred resolution and thenTry() for other preferences. Be sure to include a fallback strategy for when none of the preferred resolutions are supported.

    CameraX can decide the best fallback match based on the selected camera's capability. For example, the following code requests the maximum supported resolution for recording:

    val qualitySelector = QualitySelector
  • Query the camera capabilities first, and choose from the supported resolutions using QualitySelector::of():

    val cameraInfo = cameraProvider.availableCameraInfos.filter {
                == CameraMetadata.LENS_FACING_BACK
    val supportedQualities = QualitySelector.getSupportedQualities(cameraInfo)
    // Get union of supported qualities with qualities we want to
    // allow users to select
    val filteredQualities = listOf(
    ).filter { supportedQualities.contains(it) }
    val qualityListViewAdapter =
        QualityListViewAdapter(filteredQualities) { view: View ->
            // Inside View.OnClickListener,
            // convert QualitySelector.QUALITY_* constant to QualitySelector
            val qualitySelector = QualitySelector.of(view.tag as Int)
            // Create a new Recorder/VideoCapture for the new quality
            // and bind to lifecycle
            val recorder = Recorder.Builder()
            // ...
    // Set the view adapter for the quality selection list
    qualitySelectionView.adapter = qualityListViewAdapter

    Note that the returned capability from QualitySelector.getSupportedQualities() is guaranteed to work for either the VideoCapture use case or the combination of VideoCapture and Preview use cases. When binding with ImageCapture or ImageAnalysis, CameraX might still fail to bind when the required combination is not supported on the requested camera.

Once you have a QualitySelector, the application can create a VideoCapture object and perform the binding. Note that this binding is the same as with other use cases:

val recorder = Recorder.Builder()
val videoCapture = VideoCapture.withOutput(recorder)

try {
    // Bind use cases to camera
            this, CameraSelector.DEFAULT_BACK_CAMERA, preview, videoCapture)
} catch(exc: Exception) {
    Log.e(TAG, "Use case binding failed", exc)

The Recorder selects the most suitable format for the system. The most common video codec is H.264 AVC) with container format MPEG-4.

Configure and create recording

After binding the VideoCapture use case, the remaining capturing configuration is done using the Recorder and the recording objects that the Recorder creates.

From a Recorder, the application can create recording objects that then perform the video and audio capturing. Applications create recordings by doing the following:

  1. Configure OutputOptions with the prepareRecording().
  2. (Optional) Enable audio recording, or register a VideoRecordEvent listener.
  3. Use start() to begin video capturing.

Once you begin capturing video, the Recorder returns an ActiveRecording object. Your application can use this ActiveRecording object to finish capturing or to perform other actions, such as pausing or resuming.

A Recorder supports one ActiveRecording object at a time. You can start a new recording once you've called ActiveRecording.stop() or ActiveRecording.close() on the previous ActiveRecording object.

Let's look at these steps in more detail. First, the application configures the OutputOptions for a Recorder with Recorder.prepareRecording(). A Recorder supports the following types of OutputOptions:

  • FileDescriptorOutputOptions for capturing into a FileDescriptor.
  • FileOutputOptions for capturing into a File.
  • MediaStoreOutputOptions for capturing into a MediaStore.

All OutputOptions types enable you to set a maximum file size with setFileSizeLimit(). Other options are specific to the individual output type, such as ParcelFileDescriptor for the FileDescriptorOutputOptions.

prepareRecording() returns a PendingRecording object, which is an intermediate object that is used to create the corresponding ActiveRecording object.

The Recorder creates an intermediate configuration object called a PendingRecording. This PendingRecording object is used to configure audio and event listeners. PendingRecording is a transient class that should be invisible in most cases and is rarely cached by the app.

Applications can further configure the recording:

  • Enable audio with withAudioEnabled().
  • Register a listener to receive video recording events using withEventListener().

Finally, call PendingRecording.start() to turn the PendingRecording into an ActiveRecording. CameraX uses the ActiveRecording to start recording, sending a VideoRecordEvent.EVENT_TYPE_START event if the application has registered a callback for the events.

The following example shows how to record into a MediaStore:

// Create MediaStoreOutputOptions for our recorder
val name = "CameraX-recording-" +
        SimpleDateFormat(FILENAME_FORMAT, Locale.US)
                .format(System.currentTimeMillis()) + ".mp4"
val contentValues = ContentValues().apply {
   put(MediaStore.Video.Media.DISPLAY_NAME, name)
val mediaStoreOutput = MediaStoreOutputOptions.Builder(this.contentResolver,

// 2. Configure Recorder and Start recording to the mediaStoreOutput.
val activeRecording = videoCapture.output.prepareRecording(context, mediaStoreOutput)
   .withEventListener(ContextCompat.getMainExecutor(this), captureListener)

Control an active recording

You can pause, resume, and stop an ongoing ActiveRecording by using the following methods:

  • pause to pause the current active recording.
  • resume() to resume a paused active recording.
  • stop() to finish recording and flush any associated recording objects.

Note that you can call stop() to terminate an ActiveRecording regardless of whether the recording is in a paused or active recording state.

If you've registered an EventListener with PendingRecording.withEventListener(), the ActiveRecording communicates by using a VideoRecordEvent.

  • VideoRecordEvent.EVENT_TYPE_STATUS is used for recording statistics such as current file size and recorded time span.
  • VideoRecordEvent.EVENT_TYPE_FINALIZE is used for the recording result and includes information such as the URI of the final file along with any related errors.

Once your app receives a EVENT_TYPE_FINALIZE that indicates a successful recording session, you can then access the captured video from the location specified in OutputOptions.

