What data sources feed your AI models, and how often are they refreshed?
The vendor should publish a data-source matrix listing each provider, what it covers, and refresh latency.
Buyers need to know if the data is proprietary, third-party, or a mix — and the latency of updates. Decision criteria: source reliability, licensing cost coverage, refresh frequency, and asset-class fit.
What to ask for: a data-source matrix (one row per provider, columns for asset class, coverage, refresh interval). Compare it against your own pipeline so you know which sources are new vs. duplicated.
What a good answer looks like: equity tick data refreshed every 5 seconds; crypto books streamed continuously; macro indicators pulled hourly; news wires ingested in real-time with a clearly documented retention window.
Pulls per-asset-class: Yahoo + Alpha Vantage for equities, Binance + CoinGecko for crypto, CNN Fear & Greed for sentiment, FRED for macro, Edgar for SEC filings. Real-time price data refreshes every 5s on Ultimate / 8s on Pro. News wires + earnings + insider trades stream via the market-intelligence cron every 4 hours.
Read the docs