1
0
mirror of synced 2025-12-20 10:32:35 -05:00
Files
airbyte/docs/integrations/sources/box-data-extract.md

5.3 KiB

Box Data Extract

The Box Data Extract Connector enables seamless data extraction from Box, allowing users to access file content from their Box cloud storage.

This connector helps automate workflows by using Box AI to extract data directly from documents stored in Box.

For example, a company managing lease contracts can automatically capture key details and populate their system of record. Likewise, a financial institution can extract critical data from loan application documents—such as bank statements and W-2s—and integrate it into approval workflows.

By prioritizing content intelligence, this connector unlocks new opportunities for automation and AI-driven insights.

This page contains the setup guide and reference information for the Box Data Extract source connector.

Prerequisites

You will need a Box application configured to use Client Credential Grants (CCG) Follow this guide to complete authentication.

From your box app configuration take note of:

  • Client ID: You Box App client ID. Find yours in the Box App configurations.
  • Client Secret: You Box App client secret.

Decide on what account is going to login to Box:

  • Box Subject Type: Represents the type of user to login as ("user" or "enterprise"). Enterprise will login with the application service account. User will login with the user if app can impersonate users.
  • Box Subject ID: If subject type is "enterprise", use your enterprise ID If subject type is "user", use the user id to login as.

Choose the which Box folder conatins the files you want to process:

  • Folder ID: Folder to retreive data from.
  • Recursive: Read the folders recursively.

If you are using Box AI you'll need:

  • Ask AI Prompt: If using the Ask AI, what prompt to send the AI about the document
  • Extract AI Prompt: If using the Extract AI, what prompt to send the AI about the document
  • Extract Structured AI Fields: If using the Extract Structured AI, what prompt to send the AI about the document. For more information on the expected format of this prompt check our developer documentation.

Setup guide

Set up Box Data Extract

For Airbyte Cloud:

  1. Log into your Airbyte Cloud account.
  2. Click Sources and then click + New source.
  3. On the Set up the source page, select Box Data Extract from the Source type dropdown.
  4. Enter a name for the Box Data Extract connector.
  5. Fill in the information:
    1. Client ID
    2. Client Secret
    3. Box Subject Type
    4. Box Subject ID
    5. Folder ID
    6. Recursive
  6. Click Setup source

For Airbyte Open Source:

  1. Navigate to the Airbyte Open Source dashboard.
  2. Click Sources and then click + New source.
  3. On the Set up the source page, select Box Data Extract from the Source type dropdown.
  4. Enter a name for the Box Data Extract connector.
  5. Fill in the information:
    1. Client ID
    2. Client Secret
    3. Box Subject Type
    4. Box Subject ID
    5. Folder ID
    6. Recursive
  6. Click Setup source

Supported sync modes

The Box Data Extract source connector supports the following sync modes:

Feature Supported?
Full Refresh Sync Yes
Incremental Sync No
SSL connection Yes
Namespaces No

Supported Streams

Changelog

Expand to review
Version Date Pull Request Subject
0.1.13 2025-05-10 59315 Update dependencies
0.1.12 2025-04-26 58737 Update dependencies
0.1.11 2025-04-19 58294 Update dependencies
0.1.10 2025-04-12 57611 Update dependencies
0.1.9 2025-04-05 57154 Update dependencies
0.1.8 2025-03-29 56593 Update dependencies
0.1.7 2025-02-13 Adding Box Extract Structured AI stream
0.1.6 2025-02-13 Adding Box Extract AI stream
0.1.5 2025-02-13 Adding Box Ask AI stream
0.1.4 2025-02-12 Initial release by @BoxDevRel