Documentation Index
Fetch the complete documentation index at: https://docs.bigdata.com/llms.txt
Use this file to discover all available pages before exploring further.
Entity
A filter to match an entity by its “EntityID”. Utilize the methods provided in Knowledge Graph to identify entities/topics/sources of interest and use the obtained IDs to build queries. Example:Similarity
It calculates the embedding of the provided sentence in the Similarity filter and searches for the closest nodes in the proprietary Bigdata Vector Database. The following example searches for chunks closely related to the sentenceTariffs impacting US companies.
Keyword
We can enrich the query criteria with positive or negative Keyword filters. The keyword match is at the document title level or the chunk text. For instance, the following query will retrieve chunks that mention “Announcement” and “2024” but not “2023” in either the chunk or the document’s title. Example:The Keyword matching uses stemming, which means that the search will also
match similar words. For example, searching for “resignation” will
also match results containing the word “resignations”.
Topic
Bigdata identifies topics in the unstructured data so you can filter by them and find the text where those events have been identified. The Knowledge Graph defines 2.4k topics. The best way to get a list of relevant topics for your search is with the Co-mentions > Connected Topics method. You can also explore them in the Knowledge Graph > Find Topics page or send us an email at support@bigdata.com to request the whole taxonomy. Once you have the list of topic IDs you want to monitor, you can add them to the Search as a filter. Example:Source
Bigdata’s ecosystem comprises key high-quality content sources, including web content, premium news, press wires, call transcripts, and regulatory filings. You can focus your search on a list of trusted sources to minimize the noise and ensure novel information in your results. Example:SentimentRange
With Sentiment Ranges you can filter out document chunks by specifying a sentiment score range between -1.00 and +1.00. This score reflects the sentiment of each chunk based on the language used in every sentence. A score closer to -1.00 indicates negative sentiment, while a score closer to +1.00 indicates positive sentiment.The API support 3 values: Positive, Neutral and Negative.The Python SDK directly support numerical values for sentiment.
Document
Restrict the search to a list of specified documents. Use document IDs (e.g. from a previous search response) to search only within those documents. You can use the document filter in the API or the Document component in the Python SDK.- API
- Python SDK
In the Search API, set To exclude specific documents from the search, set
query.filters.document with mode (INCLUDE or EXCLUDE) and values (array of document IDs)."mode": "EXCLUDE" and provide the document IDs in values.Transcript
You can filter by a transcript subtype. The possible values are:ANALYST_INVESTOR_SHAREHOLDER_MEETING: Analyst, Investor and Shareholder meeting.CONFERENCE_CALL: General Conference Call.Coming SoonGENERAL_PRESENTATION: General Presentation.EARNINGS_CALL: Earnings Call.EARNINGS_RELEASE: Earnings Release.Coming SoonGUIDANCE_CALL: Guidance Call.SALES_REVENUE_CALL: Sales and Revenue Call.SALES_REVENUE_RELEASE: Sales and Revenue Release.Coming SoonSPECIAL_SITUATION_MA: Special Situation, M&A and Other.SHAREHOLDERS_MEETING: Shareholders Meeting.Coming SoonMANAGEMENT_PLAN_ANNOUNCEMENT: Management Plan Announcement.Coming SoonINVESTOR_CONFERENCE_CALL: Investor Conference Call.Coming Soon
The API still needs to support
SectionMetadataSectionMetadata: This filter allows querying for segments inside transcript documents. ADocumentChunkwill be defined by one or more sections, always within its hierarchical structure:QA: question and answer section. This section can be decomposed on:QUESTION: a question made during the session to a speaker.ANSWER: an answer from a speaker of the event.
MANAGEMENT_DISCUSSION: Management Discussion Section.
Filing
You can also query a specific Filing subtype. The possible values are:SEC_10_K: Annual report filing regarding a company’s financial performance submitted to the Securities and Exchange Commission (SEC).SEC_10_Q: Quarterly report filing regarding a company’s financial performance submitted to SEC.SEC_8_K: Report filed whenever a significant corporate event takes place that triggers a disclosure submitted to SEC.SEC_20_F: Annual report filing for non-U.S. and non-Canadian companies that have securities trading in the U.S.SEC_S_1: Filing needed to register the securities of companies that wish to go public with the U.S.SEC_S_3: Filing utilized when a company wishes to raise capital.SEC_6_K: Report of foreign private issuer pursuant to rules 13a-16 and 15d-16.SEC_DEF_14A: Definitive proxy statement the SEC requires before an annual meeting or shareholder vote; it includes material financial information and corporate governance details (for example, committee composition).
Investment Research
You can filter by investment research document subtypes. The possible values are:COMPANY_REPORT: Analysis of a single company’s financials, strategy, performance, including forecasts, valuation, and investment recommendations.COVERAGE_ANALYSIS: Document defining an analyst’s coverage universe by listing multiple companies or assets formally tracked on an ongoing basis.ECONOMIC_REPORT: Analysis of macroeconomic data (GDP, inflation, employment, central bank decisions) and forward-looking economic trends for countries or regions.FIXED_INCOME_REPORT: Analysis of debt securities like bonds or loans, covering performance, interest rates, risks, and investment recommendations.FUND_REPORT: Analytical report evaluating a fund’s performance, including attribution, transactions, risk metrics, and commentary on contributors/detractors.FX_AND_DERIVATIVES_REPORT: Research focused on derivative instruments and currency markets.GENERIC_REPORT: Miscellaneous research that cannot be assigned to other categories.INDEX_REPORT: Analysis of stock market index performance, composition, and outlook to guide investment or asset allocation decisions.INDUSTRY_REPORT: Analysis of an entire industry’s structure, trends, risks, and outlook to support investment decisions across companies in that sector.MARKET_UPDATE: Time-specific report summarizing recent market activity or price movements, often with tables or charts showing prices, returns, volume, or market breadth.PORTFOLIO_STRATEGY: Document outlining investment approach, objectives, asset allocation, equity strategies, and risk management plan for a portfolio.PORTFOLIO_SUMMARY: High-level snapshot of a portfolio showing current holdings, asset allocation, and key performance metrics.RATING_REPORT: Document assigning a buy, hold, or sell recommendation on a security, signaling structural changes like rating upgrades/downgrades or price target revisions.RESEARCH_NOTE: Brief, tactical commentary updating a broker’s investment view on a company, sector, or market event, typically triggered by specific news.THEMATIC_ANALYSIS: Analysis focused on a specific trend, sector, or investment theme, exploring drivers, opportunities, and potential impact on related companies or markets.
API
Reporting details
They help you to specify the period and and the reporting company.FiscalYear: Integer representing the annual reporting period.FiscalQuarter: Integer representing the fiscal quarter covered.ReportingEntity: Allows searching by the reporting company.
FileTag
You can filter by private uploaded documents that have specific tags. Use the tag filter in the API or FileTag in the Python SDK.- API
- Python SDK
In the Search API, set
query.filters.tag with an any_of array of tag names. Documents matching any of the specified tags are included.Query operators (SDK related)
The API requests can contain multiple filters; the SDK uses the Query operators to combine them. For example, you can combine different query filters with& (AND) | (OR) and ~ (NOT) operators.
All and Any. The first one is used to combine a list of
entities, keywords, topics, etc. with the AND operator, and the second
one is used to combine them with the OR operator. With the help from
Any the previous example would be rewritten as:
Document Version
Document Version are not yet supported in the API.
DocumentVersion for further details.
Watchlist
If you want to retrieve insights about any of the entities in a Watchlist, you can add all the entities in the query with aAny operator.
Watchlists are not yet supported in the API.