Data sources catalog

Verejný katalóg 8 data sources ktoré feedujú Entyrix DB. Každý riadok ukazuje cadence, formát, upstream URL, licenciu a freshness status. JSON / CSV variant pre automatickú konzumáciu: data-sources.json · data-sources.csv.

8 sources 0 total rows 0 enriched firm entries CC-BY 4.0 aggregate metadata
Country: ALL 🇸🇰 SK 🇨🇿 CZ 🇪🇺 EU · Category: enrichment ✕ clear

enrichment (8)

SourceCadenceFormátRowsEnriched firmsFreshnessLicence
🇸🇰 bi-ct-domains · BI-7 CT Log domain discovery
Certificate Transparency logs (CertSpotter primary, crt.sh fallback).
weekly api ? unknown public
🇸🇰 bi-cve-mapping · BI-6 CVE vulnerability mapping
NVD API lookup podľa tech+version pairs — 2.8k zraniteľných firiem.
weekly api ? unknown public-domain
🇸🇰 bi-dns-ssl-audit · BI-4 DNS/SSL audit
TLS cert, DNSSEC, SPF/DKIM/DMARC, HTTP sec headers, MX — 23.8k firiem.
weekly derived ? unknown derived
🇸🇰 bi-homepage-crawl · BI-2 Homepage crawl
Title, description, emails, phones, social links — 23.8k crawled.
weekly derived ? unknown derived
🇸🇰 bi-logo-extraction · BI-5 Logo extraction
Favicon + Open Graph image — 16k firiem.
weekly derived ? unknown derived
🇸🇰 bi-tech-detection · BI-3 Tech stack detection
40+ signatures (CMS, framework, server, CDN) — 21k firiem.
weekly derived ? unknown derived
🇸🇰 bi-website-discovery · BI-1 Website discovery
Heuristic domain probing, 23.8k firiem matched (50k/týž).
weekly derived ? unknown derived
🇸🇰 osm · OpenStreetMap POI (SK)
81k points of interest, 7k matched na companies cez website/operator/brand.
monthly api ? unknown ODbL

Freshness vyhodnotenie: ✓ fresh < 2× cadence, ⚠ behind 2-4× cadence, ✗ stale > 4× cadence · ? unknown = last_success not tracked · Licensing per upstream · Agg CC-BY 4.0 · Kontakt [email protected]