> ## Documentation Index
> Fetch the complete documentation index at: https://docs.bigdata.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Upload your own content

**Bring your proprietary knowledge into BigData** (reports, research, internal docs, and more) and make it instantly searchable and actionable through **Search** and **Research Agent** services. Your content stays private and powers answers, summaries, and analysis.

<Check>
  Private & Secure: No LLM training on your data
</Check>

Upload and manage content via the **Content API**. All endpoints require an API key in the `X-API-KEY` header. See [Authentication](/api-rest/introduction) for details. For full request/response schemas, see the [Content API reference](/api-rest/content_introduction).

<Tip>
  For async ingestion (e.g. email inbox, SharePoint), see [Connectors](/api-rest/content_introduction#overview) in the Content API introduction.
</Tip>

***

## Upload a file

Direct upload has two required steps; a third step is optional if you want to track enrichment status. The workflow **uploads** your file, **enriches** it (extraction, structure and annotation of the content), and **indexes** it for availability in Search and Research Agent.

<Steps>
  <Step title="Request an upload URL">
    **POST** to `/contents/v1/documents` with JSON metadata. The response contains a single-use pre-signed **url** and the document **id**.

    ```bash theme={null}
    curl -X POST 'https://api.bigdata.com/contents/v1/documents' \
      -H 'X-API-KEY: YOUR_API_KEY' \
      -H 'Content-Type: application/json' \
      -d '{
        "file_name": "research_report.pdf",
        "published_ts": "2025-06-15T10:30:00Z",
        "tags": ["Research Team"],
        "share_with_org": true,
        "enrichments": ["reporting_entities"]
      }'
    ```

    Include an optional **`enrichments`** array to configure the processing pipeline to apply extra enrichment steps to the document. See [Enrichments](#enrichments) below.

    **Example response:**

    ```json theme={null}
    {
      "url": "https://s3.amazonaws.com/.../uploads/F22BC027BCE166BC89DD2A81358DA2F1?AWSAccessKeyId=...",
      "id": "F22BC027BCE166BC89DD2A81358DA2F1"
    }
    ```
  </Step>

  <Step title="Upload the file to the pre-signed URL">
    **PUT** the file to the **url** from the previous response. This uploads the document; Bigdata then enriches it (extraction, structure and annotation of the content) and indexes it for availability in Search and Research Agent.

    ```bash theme={null}
    curl -X PUT 'URL_FROM_RESPONSE' \
      --data-binary '@/path/to/research_report.pdf'
    ```
  </Step>

  <Step title="(Optional) Poll to track enrichment status">
    If you want to track when the document is ready, use the **id** from step 1 with [Get document metadata](/api-reference/documents/get-document-metadata) to poll `status`. While enriching, you will see `"status": "processing"`. When enrichment and indexing are complete, the status becomes `"status": "completed"`. This step is not required, Bigdata enriches and indexes the document in the background after you upload it.

    ```bash theme={null}
    # Use the id from the POST response (e.g. F22BC027BCE166BC89DD2A81358DA2F1)
    curl -X GET 'https://api.bigdata.com/contents/v1/documents/F22BC027BCE166BC89DD2A81358DA2F1' \
      -H 'X-API-KEY: YOUR_API_KEY'
    ```
  </Step>
</Steps>

Congratulations! Your file has been uploaded, enriched, and indexed and is available for the Search and Research Agent. Try the how-to guide [Upload and search your content](/how-to-guides/private-content/upload-and-search-your-content) to see it in action.

***

## Tags

**At upload time:** Include a `tags` array in the POST body to apply tag names to the document (e.g. `["Research Team", "Q4 2025"]`). These tags can be used to filter documents in the Search and Research Agent services.

**List tags:** Use [GET /contents/v1/tags](/api-reference/tags/list-tags) to retrieve all tags visible to your organization, with document counts. Tags are returned with `id`, `name`, and `file_count`.

**Filter documents by tags:** When calling [List documents](/api-reference/documents/list-documents), use the `tags` query parameter (repeat for multiple tags). Documents matching any of the given tags are returned (OR logic).

**Use tags in Search and Research Agent:** You can scope search and research to documents with specific tags. In the [Search API](/api-reference/search/search-documents#body-query-filters-tag), set `query.filters.tag` in the request body. In the [Research Agent API](/api-reference/research-agent/research-agent#body-tools-configs-one-of-0-search-query-filters-content-one-of-0-any-of-items-document-types-items-one-of-4-tags), configure the search tool's `query_filters.content` with the desired tag filter.

***

## Enrichments

Include an optional **`enrichments`** array in the upload POST body to configure the processing pipeline to apply additional enrichment steps to your document.

Supported enrichments:

| Enrichment           | Description                                                                                                                                                                                                                                                                                                 | Availability |
| -------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------ |
| `reporting_entities` | Identifies the document's reporting company — the entity a transcript or filing is reported by — consistent with the [`ReportingEntity` reporting detail](/getting-started/search/query_filters#reporting-details) used in Search, so you can reliably filter transcripts and filings by reporting company. | Available    |
| `translation`        | Translates the document content.                                                                                                                                                                                                                                                                            | Coming soon  |

```json theme={null}
{
  "file_name": "test.pdf",
  "published_ts": "2025-06-15T10:30:00Z",
  "tags": ["test"],
  "share_with_org": false,
  "enrichments": ["reporting_entities"]
}
```

<Tip>
  Enrichments can also be configured on a [connector](/api-rest/content_introduction#connectors) so they are applied to all content ingested through it. See [Create connector](/api-reference/connectors/create-connector).
</Tip>

***

## Working with your files

**List documents:** [GET /contents/v1/documents](/api-reference/documents/list-documents) returns a paginated list of documents you can access. Use query parameters for filtering (`origin`, `tags`, `connector`, `from_date`, `file_name`, etc.), sorting (`sort_by`, `sort_order`), and pagination (`page`, `page_size`). To list only documents uploaded via direct upload (POST /documents), use **`origin=file_upload`**. Each item includes `id` (content\_id), `file_name`, `status`, `connector_id`, `tags`, `request_origin`, and timestamps.

**Example: list first page of directly uploaded documents, filter by tag**

```bash theme={null}
curl -X GET 'https://api.bigdata.com/contents/v1/documents?origin=file_upload&page=1&page_size=10&tags=test' \
  -H 'X-API-KEY: YOUR_API_KEY'
```

**Example response (excerpt)**

```json theme={null}
{
  "results": [
    {
      "id": "7FA511999C3984CB75005890B15A7096",
      "file_name": "research_report.pdf",
      "user_id": "user_id_001",
      "org_id": "org_id_001",
      "rp_collection_id": null,
      "raw_size": 654,
      "request_origin": "file_upload",
      "content_type": "application/pdf",
      "status": "completed",
      "connector_id": null,
      "error_code": null,
      "delete_started_ts": null,
      "file_metadata": null,
      "tags": [
        {
          "id": "019a48b4-e574-71d9-a8d9-6a5a86386847",
          "name": "Research Team"
        }
      ]
    }
  ]
}
```

**Get document metadata:** [GET /contents/v1/documents/{content_id}](/api-reference/documents/get-document-metadata) returns metadata for one document by its content ID: status, file name, connector, tags, timestamps. Use this to check enrichment status before downloading content.

**Example: get metadata for one document**

```bash theme={null}
curl -X GET 'https://api.bigdata.com/contents/v1/documents/E0618D0E9D3A960C1731A620EDE56B5C' \
  -H 'X-API-KEY: YOUR_API_KEY'
```

**Private content only:** The annotated and original download paths below apply **only** to documents you ingested via the Content API (this guide). They do **not** work for document `id` values returned by the Search API for RavenPack content; for those, use **[Fetch document](/api-reference/search/fetch-document)** (`GET /v1/documents/{document_id}`).

**Download the original file:** [GET /contents/v1/documents/{content_id}/original](/api-reference/documents/get-original-document) returns a time-limited pre-signed URL. GET that URL to download the original file in its native format (e.g. .pdf, .docx).

**Example: get download URL for original file, then download**

```bash theme={null}
# 1. Get the pre-signed URL
curl -X GET 'https://api.bigdata.com/contents/v1/documents/E0618D0E9D3A960C1731A620EDE56B5C/original' \
  -H 'X-API-KEY: YOUR_API_KEY'

# 2. Response contains a URL; GET it to download the file (e.g. save to disk)
# curl -o research_report.pdf 'PRE_SIGNED_URL_FROM_RESPONSE'
```

**Download the annotated version:** [GET /contents/v1/documents/{content_id}/annotated](/api-reference/documents/get-annotated-document) returns a time-limited pre-signed URL. GET that URL to download the document as structured JSON (metadata, title, body blocks, entities, sentences with sentiment). Useful for search indexing, entity extraction, or structured display.

**Example: get download URL for annotated JSON**

```bash theme={null}
curl -X GET 'https://api.bigdata.com/contents/v1/documents/E0618D0E9D3A960C1731A620EDE56B5C/annotated' \
  -H 'X-API-KEY: YOUR_API_KEY'
# Then GET the returned URL to retrieve the JSON file.
```

***

## Deleting a document

To permanently remove a document from the platform, call [DELETE /contents/v1/documents/{content_id}](/api-reference/documents/delete-document). This deletes the original file, the annotated version, and the chunks from the vector database.

Only documents in **COMPLETED** or **FAILED** status can be deleted. Attempting to delete a document that is still being enriched will raise an exception. Use [Get document metadata](/api-reference/documents/get-document-metadata) to check `status` before deleting.

* **Success:** 200 response with a null body.
* **Not found:** 404 with body `{"statusCode": 404, "message": "Document not found", "errorCode": "PRIVATE_CONTENT_NOT_FOUND", "requestId": "..."}`.

<Warning>
  Deleting a document is permanent and cannot be undone.
</Warning>

***

## Next steps

* **Try the how-to guide** — [Upload and search your content](/how-to-guides/search_in_uploaded_files) to see your uploaded content in action.
* **Try the Batch file upload** — [Batch file upload](/how-to-guides/threading_upload_files) script for uploading many files.
* **Try [Playgrounds](https://platform.bigdata.com/)** — Use the Search Service or Research Agent playgrounds below; select **My Files** to query your documents.

<Columns cols={2}>
  <Card title="Search Service Playground" icon="magnifying-glass" href="https://platform.bigdata.com/search/search-documents" horizontal>
    Search across your private content and other sources. In the playground, open the source selector and choose **My Files** to limit results to your uploaded documents.
  </Card>

  <Card title="Research Agent Playground" icon="robot" href="https://platform.bigdata.com/research-agent" horizontal>
    Run research over your private content and real-time data. In the playground, use the source selector and filter by **My Files** to ground answers in your documents.
  </Card>
</Columns>

<Tip>
  **Related documentation**

  * [Content API introduction](/api-rest/content_introduction): Connectors vs direct upload, and document operations.
  * [Search API filters](../getting-started/search/query_filters#filetag): Use the tag and category filter to restrict queries to your uploaded files only. Or try it with the [Search API reference playground](/api-reference/search/search-documents#body-query-filters-category)
  * [Research Agent search tool filters](/api-reference/research-agent/research-agent#body-tools-configs-one-of-0-search): Restrict context to your proprietary content with [content filters](/api-reference/research-agent/research-agent#body-tools-configs-one-of-0-search-query-filters-content-one-of-0-any-of-items-document-types-items-one-of-4-tags) and [tags](/api-reference/research-agent/research-agent#body-tools-configs-one-of-0-search-query-filters-content-one-of-0-any-of-items-document-types-items-one-of-4-tags).
</Tip>