Credit Ratings Monitoring

This workflow demonstrates the capabilities of Bigdata to monitor specific events. In this example, we track credit ratings news for a Company and extract relevant features to track changes over time.

Why It Matters

Credit rating changes or outlook revisions can have immediate effects on corporate bond spreads, equity valuations, and counterparty risk assessments. For traders, portfolio managers, and credit analysts, staying ahead of these developments is critical to anticipate market reactions and adjust exposure before the news is fully priced in.

What It Does

This workflow systematically detects, labels, and summarizes event-related news for a selected watchlist of companies and entities using the Bigdata API for content retrieval and large language models for feature extraction. By customizing the prompts, keywords, and parameters, this framework can be adapted to monitor any type of corporate event or regulatory development - from credit ratings and earnings announcements to regulatory changes and strategic developments. The output includes both structured datasets and analytical reports for monitoring or backtesting.

How It Works

The workflow implements a four-step agentic pipeline built on Bigdata API:

Content Retrieval & Enhancement: Search for entities and event-specific keywords using the Bigdata Search API. Run queries over configurable time windows with parallel processing, collecting raw content with metadata and enriching results with surrounding context.
Feature Extraction & Validation: Use LLM-powered analysis to identify entity relationships, extract structured event features, and validate classifications. The prompts can be customized to extract any necessary features for different event types and domains.
Advanced Analytics Generation: Derive timestamped analytics with event-specific scoring and sentiment analysis tailored to your monitoring requirements.
Report Generation: Produce dated timelines of events with supporting quotes, source links, and exportable datasets for further analysis or integration into existing workflows.

A Real-World Use Case

This cookbook demonstrates the complete workflow through a practical example: tracking credit rating updates and outlook revisions for Tesla over a three-year period. You’ll learn how to transform unstructured rating-related news into structured insights that highlight rater-ratee relationships, analyst commentary, and market implications. Ready to get started? Let’s dive in!

Prerequisites

To run the Credit Ratings Monitoring workflow, you can choose between two options:

💻 GitHub cookbook
- Use this if you prefer working locally or in a custom environment.
- Follow the setup and execution instructions in the README.md.
- API keys are required:
  - Option 1: Follow the key setup process described in the README.md
  - Option 2: Refer to this guide: How to initialise environment variables
    - ❗ When using this method, you must manually add the OpenAI API key:
      # OpenAI credentials OPENAI_API_KEY = "<YOUR_OPENAI_API_KEY>"
🐳 Docker Installation
- Docker installation is available for containerized deployment.
- Provides an alternative setup method with containerized deployment, simplifying the environment configuration for those preferring Docker-based solutions.

Setup and Imports

Async Compatibility Setup

Run this cell first - Required for Google Colab, Jupyter Notebooks, and VS Code with Jupyter extension:

try:
    import asyncio
    asyncio.get_running_loop()
    import nest_asyncio; nest_asyncio.apply()
    print("✅ nest_asyncio applied")
except (RuntimeError, ImportError):
    print("✅ nest_asyncio not needed")

Below is the Python code required for setting up our environment and importing necessary libraries.

import pandas as pd
from bigdata_client import Bigdata
from bigdata_client.models.search import DocumentType
from src.knowledge_graph_manager import *
from src.search_enhanced import search_enhanced
from src.feature_extractor import FeatureExtractor
from src.feature_extractor import clean_credit_ratings_features_dataset
from src.summary_generator import SummaryGenerator
from src.visuals import ReportVisualizer

# Setup for Plotly
import plotly.io as pio
import plotly.graph_objects as go

try:
    import os
    if 'JUPYTERHUB_SERVICE_PREFIX' in os.environ or 'JPY_SESSION_NAME' in os.environ:
        pio.renderers.default = 'jupyterlab'
        print("✅ Plotly configured for JupyterLab")
    else:
        pio.renderers.default = 'plotly_mimetype+notebook'
        print("✅ Plotly configured for Jupyter/VS Code")
except:
    pio.renderers.default = 'notebook'
    print("✅ Plotly configured with fallback renderer")

interactive_plots = True  # Set to False to generate static plots

# Define output file paths for our results
output_dir = "report"
os.makedirs(output_dir, exist_ok=True)

export_path = f"{output_dir}/credit_ratings_monitor.csv"

Defining Your Event Monitoring Parameters

To perform an event monitoring and feature extraction analysis, you need to define a few key parameters:

Company Names (company_names): The set of companies to monitor events for (e.g. your portfolio or watchlist)
Rating Agencies Names (rating_agencies_names): (Optional) The list of entities you want to be co-mentioned with your entities and events
Keywords (keywords): The keywords characterizing the event
Time Period (start_date_query and end_date_query): The date range over which to run the search
Frequency (frequency): The frequency of the date ranges to search over. Supported values:
- Y: Yearly intervals.
- M: Monthly intervals.
- W: Weekly intervals.
- D: Daily intervals
Document Limit (document_limit): The maximum number of documents to return per query to Bigdata API.
Batch Size (batch_size): The number of entities to include in a single batched query.
Document Type (document_type): Specify which documents to search over (transcripts, filings, news)
Model Selection (llm_model): The AI model used for semantic analysis and topic classification

# ===== Context Definition ====
company_names = ['Tesla']
rating_agencies_names = ['S&P Global', 'Fitch Ratings Inc', "Moody's Corp"]
keywords = ['credit rating']  # Select the keyword related to the event you need to track

# ===== Specify Time Range =====
start_date_query = '2021-10-01'  # Start date
end_date_query = '2024-11-12'    # End date
frequency = 'D'

# ===== Query Configuration =====
document_limit = 100  # Maximum number of retrieved documents for each day
batch_size = 1        # number of companies to process in each batch
document_type = DocumentType.NEWS  # Scope of search

# ===== LLM Specification =====
model = 'openai::gpt-4o-mini'

Portfolio Selection

Define your watchlist starting from the companies name. For the purpose of this example, the workflow selects Tesla. The Entity ID is retrieved by leveraging Bigdata.com’s Knowledge Graph.

companies, full_company_names, company_objects = get_entity_ids(company_names)

In order to boost the accuracy of the retrieval process, the workflow includes a selection of control entities in the queries, that is, entities that have to appear alongside Tesla in the search results. For the purpose of this example, the workflow selects the list of Credit Rating Agencies (CRAs), such as S&P, Moody’s, and Fitch.

rating_agencies, full_agencies_names, rating_agency_companies = get_entity_ids(rating_agencies_names)

Content Retrieval from Bigdata Search API

The workflow searches news content using the Bigdata API to find articles mentioning Tesla, rating agencies, and credit rating keywords. The search runs across daily windows with parallel processing for efficiency. The search_enhanced function retrieves not only the matching text chunks but also their surrounding context (previous and next paragraphs) to provide richer information for analysis. Results are stored in the contextualized_chunks DataFrame for feature extraction.

contextualized_chunks = search_enhanced(
    companies=companies,
    keywords=keywords,
    sentences=None,
    control_entities=rating_agencies,
    start_date=start_date_query,
    end_date=end_date_query,
    scope=document_type,
    freq='D',
    document_limit=100,
    batch_size=1,
    enhance_search=True,
)

Features Augmentation

Entity Role Detection

The LLM agent is prompted to label the sentences extracted to identify the role played by the entities detected in each sentence, detecting raters and ratees. A unique identifier for the tuple of entity name, document headline, and augmented text is created and the content is sent to the LLM for role detection.

feature_extractor = FeatureExtractor(llm_model=model)

df_labeled = feature_extractor.assign_entity_roles(
    contextualized_chunks, 
    'contextualized_chunk_text', 
    ['entity_name', 'headline'], 
    action_type='detect'
)

Entity Role Validation

Subsequently, the LLM agent is prompted to validate the roles identified in the previous steps. The original text, the label assigned and the motivation are provided, and the LLM is instructed to either confirm or correct the role assigned to each entity.

df_labeled_valid = feature_extractor.assign_entity_roles(
    df_labeled, 
    'contextualized_chunk_text', 
    ['entity_name','headline', 'motivation','label'], 
    action_type='validate'
)

Credit Features Extraction

The LLM is instructed to augment the features related to long-term and short-term credit ratings, credit outlooks, and analysts’ comments in a multi-step approach leveraging three different prompts. The prompts can be customized to extract any necessary features. Prompt 1 is designed to extract the following:

Credit Rating: Extract the overall credit rating assigned to Tesla.
Credit Action: Extract any change or affirmation of the credit rating assigned to Tesla, categorized as:
- Upgrade: An improvement in the rating.
- Downgrade: A decrease in the rating.
- Affirmed: Rating confirmed with no change.
- Corrected: Adjusted due to an error.
- Withdrawn: The rating is removed.
- Reinstated: A withdrawn rating is restored.
Credit Status: Any additional information regarding the credit rating status, categorized as:
- Provisional Rating: A preliminary rating.
- Matured or Paid in Full: When the obligation reaches maturity.
- No Rating: Rating declined or unavailable.
- Published: Officially issued or announced.
Credit Outlook: The credit outlook mentioned by the rater, and any related mention of the credit rating assessed in the coming weeks, months or years.
- Positive: Suggests potential improvement.
- Negative: Indicates potential downgrade.
- Stable: No expected change.
- Developing: Change possible based on future events.
Credit Watchlist: Any mention of Tesla being placed in a credit watchlist for review of the credit rating. Labelled as:
- Watch: The rating is on a watchlist.
- Watch Positive: Potential upgrade.
- Watch Negative: Suggests downgrade.
- Watch Removed: No longer active.
- Watch Unchanged: Status remains without change in expectation.

df_labeled_valid['date'] = pd.to_datetime(df_labeled_valid['timestamp_utc']).dt.date

exploded_df = feature_extractor.group_text_and_labels(
    df_labeled_valid, 
    group_columns=['date', 'sentence_id', 'headline', 'source_name', 'contextualized_chunk_text', 'url'], 
    role_column='validated_label', 
    entity_column='entity_name'
)

features_extracted = {}

df_credit_ratings = feature_extractor.extract_single_feature(
    exploded_df, 
    feature_type='credit_ratings', 
    text_col='contextualized_chunk_text', 
    additional_prompt_fields=['rater_entity', 'ratee_entity', 'unclear_entities']
)
features_extracted['credit_ratings'] = df_credit_ratings

Prompt 2 is designed to extract the following:

Short Term Credit Rating: Any credit rating assigned to Tesla and specifically referred to a short-term debt instrument, if mentioned.
Long Term Credit Rating: Any credit rating assigned to Tesla and specifically referred to a long-term debt instrument, if mentioned.
Debt Instrument: The debt instruments under study.

df_debt_instruments = feature_extractor.extract_single_feature(
    exploded_df, 
    feature_type='debt_instruments', 
    text_col='contextualized_chunk_text', 
    additional_prompt_fields=['rater_entity', 'ratee_entity', 'unclear_entities']
)
features_extracted['debt_instruments'] = df_debt_instruments

Prompt 3 is designed to extract the following:

Key Drivers: Any motivating the credit rating or outlook decision, and influencing the credit quality of the ratee entity, including, but not limited to:
- Cash flow generation (e.g. earnings, revenues, dividends, assets)
- Insider trading, stock prices, stock picks
- Capital structure changes (e.g. equity actions, acquisitions, mergers)
Forward Guidance: Capture any forward guidance discussed regarding current or future credit ratings, including any potential changes or outlook updates.

df_drivers_guidance = feature_extractor.extract_single_feature(
    exploded_df, 
    feature_type='drivers_guidance', 
    text_col='contextualized_chunk_text', 
    additional_prompt_fields=['rater_entity', 'ratee_entity', 'unclear_entities']
)
features_extracted['drivers_guidance'] = df_drivers_guidance

# Combine all features
df_ext = feature_extractor._combine_features(exploded_df, features_extracted)

df_ext['ratee_entity_rp_entity_id'] = df_ext['ratee_entity'].map(
    dict(zip([company.name for company in company_objects], [company.id for company in company_objects]))
)
df_ext = df_ext.sort_values('date').reset_index(drop=True)

Deriving a Structured Dataframe of Advanced Analytics

The workflow provides a timestamped dataframe of credit ratings news with advanced analytics generated through the feature augmentation process. This dataset can be exported in CSV for Excel for further analysis, such as validation, augmentation, or backtesting.

df_clean = clean_credit_ratings_features_dataset(df_ext)

DataFrame Preview: 59 rows × 19 columns

View Full Dataset (First 5 Rows with All Columns)

#	Date	Sentence ID	Headline	Source	URL	Contextualized Chunk Text	Ratee Entity ID	Ratee Entity	Rater Entity	Credit Rating	Credit Outlook	Credit Action	Credit Status	Credit Watchlist	Short Term Rating	Long Term Rating	Debt Instrument	Forward Guidance	Key Drivers
0	2021-10-22	27780A0CE57AEE659F760AFD200B9758-1	S&P raises Tesla issuer credit rating to ‘BB+’ with positive outlook	The Fly	-	S&P Global Ratings raised its issuer credit and issue-level ratings on Tesla to ‘BB+’ with a positive outlook. The outlook reflects the view that Tesla’s free operating cash flow generation “will remain positive more consistently, even as the company expands its global manufacturing footprint over the next 12 months,” S&P said in a statement. Despite near-term supply bottlenecks for the industry, the firm expects Tesla deliveries and earnings “to remain strong over the next few quarters.”	DD3BB1	Tesla Inc.	S&P Global Inc.	BB+	Positive	Upgrade	-	-	-	BB+	-	S&P Global Ratings raised its issuer credit and issue-level ratings on Tesla to ‘BB+’ with a positive outlook, reflecting the view that Tesla’s free operating cash flow generation will remain positive more consistently over the next 12 months.	Positive free operating cash flow generation, Expansion of global manufacturing footprint, Strong expected deliveries and earnings over the next few quarters, Near-term supply bottlenecks for the industry
1	2021-10-22	8E35C0E1D412DDE5CD1C1943F1E474B4-1	S&P Global Ratings Upgrades Tesla With Positive Outlook	MT Newswires	-	S&P Global Ratings on Friday upgraded its issuer credit ratings of Tesla (TSLA) to BB+ with a positive outlook, citing “solid” demand prospects and “robust” financial metrics. The company’s free operating cash flow is expected to remain positive “more consistently,” S&P noted.	DD3BB1	Tesla Inc.	S&P Global Inc.	BB+	Positive	Upgrade	-	-	-	BB+	-	S&P Global Ratings upgraded its issuer credit ratings of Tesla to BB+ with a positive outlook.	solid demand prospects, robust financial metrics, positive free operating cash flow expected to remain more consistent
2	2021-11-09	0F0FFE7AEF0B8468B3285D1738A561D6-13	Seven Elon Musk Tweets That Sent Tesla Shares on a Wild Ride	Bloomberg News	Link	An April Fools’ joke that fell flat, Musk’s tweet that Tesla has gone “completely and totally bankrupt” came following a run of bad news for the automaker, including production shortfalls, regulatory scrutiny over its driver-assistance system Autopilot and a credit rating downgrade further into junk by Moody’s Investors Service.	DD3BB1	Tesla Inc.	Moody’s Corp.	junk	-	Downgrade	-	-	-	junk	-	-	production shortfalls, regulatory scrutiny over driver-assistance system Autopilot, credit rating downgrade further into junk by Moody’s Investors Service
3	2021-11-10	7D50135E60C2C1444CCBE211B498D62C-12	Seven Elon Musk tweets that sent Tesla shares on a wild ride	Gulf Business	Link	An April Fools’ joke that fell flat, Musk’s tweet that Tesla has gone “completely and totally bankrupt” came following a run of bad news for the automaker, including production shortfalls, regulatory scrutiny over its driver-assistance system Autopilot and a credit rating downgrade further into junk by Moody’s Investors Service.	DD3BB1	Tesla Inc.	Moody’s Corp.	junk	-	Downgrade	-	-	-	junk	-	-	production shortfalls, regulatory scrutiny over driver-assistance system Autopilot, credit rating downgrade further into junk
4	2022-01-25	FE7FB59DB58B3CBD9BEC38D0D088ACED-1	Tesla Inches Toward Blue-Chip Status With Moody’s Credit Upgrade	Bloomberg News	Link	Moody’s Investors Service Inc.’s move to ramp up Tesla Inc.’s credit rating to the cusp of investment grade is bolstering expectations that the famous electric vehicle maker will secure blue-chip status as soon as early next year. “Maintenance of current performance may be enough for additional Tesla upgrades at Moody’s,” he wrote in a note Tuesday.	DD3BB1	Tesla Inc.	Moody’s Corp.	Cusp of investment grade	-	Upgrade	-	-	-	cusp of investment grade	-	Expectations that Tesla will secure blue-chip status as soon as early next year; maintenance of current performance may be enough for additional upgrades at Moody’s.	current performance, credit rating increase to the cusp of investment grade

Report Generation

In this step, the workflow summarizes the timeline of credit ratings news. Summarization is performed in two steps, removing duplicates and repeated events by generating daily summaries and generating a timeline that highlights new information. Alongside the timeline of events, the workflow creates a final table which synthetizes the credit rating changes by rating agency, and visualizes it in an interactive plot.

summary_generator = SummaryGenerator(llm_model=model)

reports_dict = summary_generator.generate_report_by_entities(
    df=df_clean, 
    entity_keys=companies, 
    text_col='contextualized_chunk_text',
    fields_for_summary=['date', 'ratee_entity', 'headline','source_name','url','contextualized_chunk_text']
)

Save and Display Reports

Reports and tables can be customized and exported as HTML files for further analysis.

visualizer = ReportVisualizer(output_dir)

Tesla Inc.

company_report_text, company_table = reports_dict[f"{company_objects[0].id}"]

# Generate HTML report
html_content = visualizer.convert_text_to_html(
    entity_name=company_objects[0].name,
    start_date=start_date_query, 
    end_date=end_date_query,
    text=company_report_text
)

visualizer.display_html_report(html_content)

# Generate and save rating chart
fig = visualizer.plot_credit_ratings(df=company_table, save_path='credit_ratings_timeline')

Export the Results

Export the results for further analysis or to share with the team.

visualizer.save_html_report(html_content, company_objects[0].name, start_date_query, 
                            end_date_query, filename="credit_ratings_report.html")

# Save the final cleaned dataset with the extracted features
df_clean.to_csv(export_path)

Conclusion

The Credit Ratings Monitoring provides a comprehensive automated framework for tracking and analyzing credit rating events across your portfolio or watchlist. By systematically combining advanced information retrieval with LLM-powered feature extraction, this workflow transforms unstructured news data into actionable intelligence for credit analysis and risk management. Through the automated analysis of credit rating dynamics, you can:

Monitor rating changes in real-time - Stay ahead of upgrades, downgrades, and outlook revisions that could impact bond spreads and equity valuations
Extract structured insights - Transform narrative credit news into structured features including rating actions, outlooks, watchlist status, and key drivers
Generate actionable reports - Produce timestamped timelines and exportable datasets for backtesting, portfolio management, and regulatory compliance
Analyze market implications - Connect credit events to forward guidance and analyst commentary to anticipate market reactions

Whether you’re managing credit portfolios, conducting fixed income research, or monitoring counterparty risk, this analysis automates complex credit surveillance while delivering comprehensive insights. The multi-step feature extraction approach ensures that both explicit rating changes and subtle outlook shifts are captured systematically, making it essential for credit-focused investment strategies.

Cookbooks

Conversational AI

Search & Discovery

Market & Financial Analysis

Risk Management

Screening & Monitoring

Insights & Reporting

Why It Matters

What It Does

How It Works

A Real-World Use Case

Prerequisites

Setup and Imports

Async Compatibility Setup

Defining Your Event Monitoring Parameters

Portfolio Selection

Content Retrieval from Bigdata Search API

Features Augmentation

Entity Role Detection

Entity Role Validation

Credit Features Extraction

Deriving a Structured Dataframe of Advanced Analytics

Report Generation

Save and Display Reports

Tesla Inc.

Export the Results

Conclusion

Cookbooks

Conversational AI

Search & Discovery

Market & Financial Analysis

Risk Management

Screening & Monitoring

Insights & Reporting

​Why It Matters

​What It Does

​How It Works

​A Real-World Use Case

​Prerequisites

​Setup and Imports

​Async Compatibility Setup

​Defining Your Event Monitoring Parameters

​Portfolio Selection

​Content Retrieval from Bigdata Search API

​Features Augmentation

​Entity Role Detection

​Entity Role Validation

​Credit Features Extraction

​Deriving a Structured Dataframe of Advanced Analytics

​Report Generation

​Save and Display Reports

​Tesla Inc.

​Export the Results

​Conclusion

Why It Matters

What It Does

How It Works

A Real-World Use Case

Prerequisites

Setup and Imports

Async Compatibility Setup

Defining Your Event Monitoring Parameters

Portfolio Selection

Content Retrieval from Bigdata Search API

Features Augmentation

Entity Role Detection

Entity Role Validation

Credit Features Extraction

Deriving a Structured Dataframe of Advanced Analytics

Report Generation

Save and Display Reports

Tesla Inc.

Export the Results

Conclusion