Analyze corporate exposure to specific themes and events, quantifying the impact for each company in your universe.
bigdata-research-tools
package, providing a deployable API for systematic thematic analysis.
The service uses the Bigdata API to analyze corporate exposure to specific themes using unstructured data from news, earnings calls, and regulatory filings. It combines LLM-powered theme taxonomies, semantic content retrieval, and structured scoring methodologies to transform narrative signals into quantifiable thematic exposure metrics.
If you prefer to work with the thematic screener as a Python package directly, you can explore the Thematic Screeners which provides comprehensive Jupyter notebook cookbooks and detailed examples.
http://localhost:8000/
and the documentation for the API at http://localhost:8000/docs
.
http://localhost:8000/
for a user-friendly dashboard that lets you run thematic screenings through a visual interfacehttp://localhost:8000/docs
for complete API documentation with interactive examplestheme
: The main theme, topic, or trend you want to screen for exposure. It can be specified as a single word or as a short sentence. The Screener will generate a list of sub-themes representing individual, self contained components of the main theme. The theme can contain multiple core concepts, but we would recommend not adding too many core concepts in the same screener run. (e.g., “Artificial Intelligence”, “Supply Chain Reshaping”, “Energy Transition”)focus
: Use this parameter to pass additional, custom instructions to the llm when breaking down the theme into sub-themes. These parameters allow you to guide the mindmap creation and customize it to your needs, as it allows users to inject their own domain knowledge, your specific point of view, and it will ensure that the mindmap will focus on the core concepts required.companies
: The portfolio of companies you want to screen for exposure, either as a list of RavenPack entity IDs representing individual companies ["4A6F00", "D8442A"]
or a watchlist ID "44118802-9104-4265-b97a-2e6d88d74893"
. Watchlists can be created programmatically using the Bigdata.com SDK or through the Bigdata appstart_date
/ end_date
: The start and end of the time sample during which you want to screen your portfolio for thematic exposure. The value has to be specified as a string in YYYY-MM-DD format.document_type
: The type of documents to search over. Use this parameter to point your screener to analyse text data extracted from news, corporate transcripts, or corporate filings. Currently, only supports “TRANSCRIPTS”.fiscal_year
: when screening for exposure in Transcripts and Filings, these documents can be further filtered by their reporting details. fiscal_year
represents the annual reporting period of the transcript and can be used in combination with start_date and end_date to further limit the queries to only those that are time sensitive from a calendar year and reporting period perspective. This parameter is not to be applied to News as news are not augmented with reporting metadata.frequency
: This parameter allows you to break down your sample range into higher frequency intervals. It can be useful when running a screener on a large sample, as the document_limit parameter will limit the ability of search to retrieve a representative sample of documents across many months. Instead of increasing the document limit, breaking down the creation of a large archive into smaller intervals will allow you to have more control over the retrieval process and obtain a more meaningful representation of exposure over time. The value must be one of: D
, Y
, M
, 3M
or Y
.http://localhost:8000/docs
.