Performance Tuning

Use these steps to size hardware, tune configuration, and keep query latency predictable.

Tuning quick-start

Set concurrency: Align THREAD_COUNT with available CPU cores for write-heavy or analytical workloads.
Control cache churn: Increase CACHE_SIZE for highly diverse parameterized queries.
Bound work: Use TIMEOUT_DEFAULT and TIMEOUT_MAX to cap long-running queries; set MAX_QUEUED_QUERIES to protect memory under load.
Size results: Cap responses with RESULTSET_SIZE and QUERY_MEM_CAPACITY so runaway queries fail fast.
Profile before shipping: run GRAPH.PROFILE and GRAPH.EXPLAIN to validate query plans.

Prefer parameterized queries to maximize plan cache hit rate and reduce parse/plan overhead.
Add indexes before tuning hardware: see range indexes, full-text, and vector indexes.
Keep projections narrow: return only needed fields; paginate with SKIP/LIMIT.
Avoid Cartesian products: ensure patterns are selective and anchored with labels/properties.

Keep THREAD_COUNT near physical cores for balanced workloads; lower it if you see CPU saturation from many concurrent writes.
Increase MAX_QUEUED_QUERIES cautiously to avoid memory bloat; combine with timeouts to shed load gracefully.
For mixed workloads, reserve a dedicated FalkorDB instance for heavy analytics so OLTP queries stay predictable.

Tune RESULTSET_SIZE to prevent accidental full-graph scans from overwhelming clients.
For large bulk inserts, stage writes and batch in transactions rather than single huge queries to reduce peak memory.
Monitor memory after raising CACHE_SIZE; higher caches improve plan reuse but consume RAM.

Track query latency, queue depth, and timeout counts in your monitoring stack.
Re-run GRAPH.PROFILE after schema or index changes; plan shape and cost can shift when data distributions change.
Baseline throughput for representative datasets and queries, then document expected SLOs for teams.