Identifying Corporate Exposure to Risks
RiskAnalyzer
class, part of the bigdata-research-tools
package, is purpose-built to meet this challenge. Designed for risk analysts, portfolio managers, and investment professionals, it systematically analyzes corporate exposure to specific risk channels using unstructured data from news, earnings calls, and regulatory filings.
RiskAnalyzer
combines hybrid semantic search, risk factor taxonomies, and structured validation techniques to deliver:
README.md
.README.md
main_theme
): The risk scenario to analyze (e.g. US Import Tariffs against China)focus
): The analyst focus that provides an expert perspective on the scenario and helps break it down into risk factorscompanies
): The set of companies to screencontrol_entities
): The countries, people, or organizations that characterize the risk scenariokeywords
): The key concepts of the risk scenariostart_date
and end_date
): The date range over which to run the searchdocument_type
): Specify which documents to search over (transcripts, filings, news)fiscal_year
): If the document type is transcripts or filings, fiscal year needs to be specifiedsources
): Specify set of sources within a document type, for example which news outlets (available via Bigdata API) you wish to search overllm_model
): The AI model used for semantic analysisrerank_threshold
): By setting this value, you’re enabling the cross-encoder which reranks the results and selects those whose relevance is above the percentile you specify (0.7 being the 70th percentile). More information on the re-ranker can be found here.export_path
): The path to export the results in an Excel filefreq
): The frequency of the date ranges to search over. Supported values:
Y
: Yearly intervalsM
: Monthly intervalsW
: Weekly intervalsD
: Daily intervals.document_limit
): The maximum number of documents to return per query to Bigdata APIbatch_size
): The number of entities to include in a single batched querygenerate_results
will calculate the composite score, summing up the scores across the sub-scenarios for each company (df_company
) or industry (df_industry
) and add a global motivation statement (df_motivation
).