Performance Guide¶

This guide summarizes benchmark behavior for pycubrid and shows practical tuning steps.

Table of Contents¶

Overview
Benchmark Results
Performance Characteristics
Optimization Tips
Performance Investigation
Timing & Profiling Hooks
Running Benchmarks

Overview¶

pycubrid is a pure Python DBAPI2 driver that talks to CUBRID over the CAS binary protocol.

mermaid flowchart LR App[Python Application] --> Driver[pycubrid\nPure Python DBAPI2] Driver --> CAS[CAS Binary Protocol over TCP] CAS --> Broker[CUBRID Broker / CAS] Broker --> Server[(CUBRID Server)]

mermaid flowchart TD Q[SQL + Parameters] --> Encode[Python object encoding] Encode --> Packet[CAS packet serialization] Packet --> Net[TCP round-trip] Net --> Exec[Server execution] Exec --> Decode[Row decoding to Python objects]

Benchmark Results¶

Source: cubrid-benchmark

Environment: Intel Core i5-9400F @ 2.90GHz, 6 cores, Linux x86_64, Docker containers.

Workload: Python pycubrid vs PyMySQL, 10000 rows x 5 rounds.

Scenario	CUBRID (pycubrid)	MySQL (PyMySQL)	Ratio (CUBRID/MySQL)
insert_sequential	10.47s	1.74s	6.0x
select_by_pk	15.99s	3.52s	4.5x
select_full_scan	10.31s	1.86s	5.5x
update_indexed	10.70s	2.19s	4.9x
delete_sequential	10.75s	2.10s	5.1x

Performance Characteristics¶

pycubrid is pure Python, so packet encode/decode and row conversion run in the interpreter.
PyMySQL can benefit from optional C acceleration in parts of the stack, which reduces CPU overhead.
CAS uses a binary protocol with explicit packet framing and parsing; this adds per-request work.
Small, chatty queries amplify Python-level and round-trip overhead.
Throughput improves when calls are batched and transaction boundaries are controlled.

Optimization Tips¶

Use explicit transactions for write bursts instead of per-statement commits.
Batch inserts and updates with executemany() where possible.
Reuse long-lived connections to avoid repeated handshake cost.
Select only required columns and avoid unnecessary full scans.
Keep hot predicates indexed and validate plans in CUBRID.

mermaid flowchart TD Start[Slow query path] --> Batching{Batchable workload?} Batching -->|Yes| ExecMany[Use executemany / multi-row patterns] Batching -->|No| Index{Index coverage good?} Index -->|No| AddIdx[Add or tune index] Index -->|Yes| Txn{Too many commits?} Txn -->|Yes| GroupTxn[Group statements in one transaction] Txn -->|No| Net[Profile network and CAS round-trips]

Performance Investigation¶

Use this workflow when cubrid-benchmark detects a measurable gap or regression. The goal is to reproduce, profile, fix, and verify — without hardcoding thresholds that age badly.

When to Investigate¶

A benchmark run shows a ratio increase vs the baseline recorded in this document.
A CI run flags a deviation from the previous run's numbers.
You are about to submit a change to the hot path (protocol.py, packet.py, cursor.py).

Workflow¶

mermaid flowchart TD Detect[cubrid-benchmark detects gap] --> Issue[File a Performance issue\nusing the issue template] Issue --> Profile[Run profiling scripts\nto isolate the hot path] Profile --> Optimize[Apply targeted fix\n(see Optimization Tips)] Optimize --> Verify[Re-run profiling scripts\nand cubrid-benchmark] Verify --> Close[Attach results to issue\nand close]

File an issue — use the Performance Investigation template. Paste the benchmark output and link the CI run that triggered this.
Profile the affected operation — pick the script that matches the slow operation:

Operation	Script
Connection handshake	`scripts/profile_connect.py`
INSERT / SELECT / UPDATE / DELETE	`scripts/profile_execute.py`
Row fetching (fetchone/fetchall/fetchmany)	`scripts/profile_fetch.py`

Optimise — guided by cProfile's cumulative time, focus changes on the top frames. Keep patches targeted; avoid speculative refactors.
Verify — re-run the profiling script and the full benchmark suite. Attach before/after numbers to the issue.

Running the Profiling Scripts¶

All scripts require a live CUBRID instance. Defaults target localhost:33000/demodb with user dba.

Connection handshake¶

# 100 connect/close cycles (default):
python scripts/profile_connect.py

# Custom target, 50 iterations, save .prof:
python scripts/profile_connect.py \
    --host myhost --port 33000 --database testdb \
    --user dba --password secret \
    --iterations 50 --output connect.prof

Statement execution¶

# All DML operations, 100 iterations each (default):
python scripts/profile_execute.py

# INSERT only, 200 iterations:
python scripts/profile_execute.py --operation insert --iterations 200

# Save .prof for snakeviz:
python scripts/profile_execute.py --output exec.prof

Result fetching¶

# 1000 rows, 50 fetch iterations (default):
python scripts/profile_fetch.py

# 5000 rows, 20 iterations, fetchmany batch size 100:
python scripts/profile_fetch.py --rows 5000 --iterations 20 --fetch-size 100

# Save .prof for snakeviz:
python scripts/profile_fetch.py --output fetch.prof

Visualising .prof files with snakeviz¶

pip install snakeviz
snakeviz profile_output.prof

snakeviz opens an interactive flame graph in the browser, making it easy to drill into nested call stacks.

Timing & Profiling Hooks¶

For lightweight in-process diagnosis you can opt into the driver's built-in timing instrumentation instead of running the cProfile-based scripts above. Hooks are off by default — when disabled the timing module is never imported and the hot path runs unchanged.

When to use which¶

Use case	Tool
"Where is wall-clock time going across `connect` / `execute` / `fetch` / `close` in my application?"	`enable_timing=True` (this section)
"Which Python frames inside `cursor.execute` are hot?"	`scripts/profile_execute.py` (cProfile)
"How does pycubrid compare to PyMySQL on a controlled workload?"	cubrid-benchmark

Enabling¶

Pass the enable_timing=True keyword to pycubrid.connect():

import pycubrid

conn = pycubrid.connect(
    host="localhost", port=33000, database="testdb", user="dba",
    enable_timing=True,
)

Or set the environment variable so timing is enabled for every connection in a process — useful in benchmark harnesses and CI jobs:

export PYCUBRID_ENABLE_TIMING=1   # also accepts true / yes (case-insensitive)
python my_workload.py

The explicit keyword always wins over the environment variable. Async connections support the same keyword on pycubrid.aio.connect().

Reading the stats¶

cur = conn.cursor()
cur.executemany(
    "INSERT INTO bench (n) VALUES (?)",
    [(i,) for i in range(1000)],
)
cur.execute("SELECT n FROM bench")
cur.fetchall()

stats = conn.timing_stats
print(stats)
# TimingStats(connect=1 calls, 12.345ms total, 12.345ms avg,
#             execute=2 calls, 18.700ms total, 9.350ms avg,
#             fetch=1 calls, 4.200ms total, 4.200ms avg,
#             close=0 calls)

# Programmatic access (nanoseconds, ints):
exec_avg_ms = stats.execute_total_ns / stats.execute_count / 1_000_000
print(f"average execute: {exec_avg_ms:.3f} ms")

# Reset between phases
stats.reset()

Connection.timing_stats is None when timing is disabled, so guard accordingly:

if conn.timing_stats is not None:
    print(conn.timing_stats)

Categories and granularity¶

Category	What it covers
`connect`	TCP socket setup + CAS broker handshake + database open. Recorded even on failure.
`execute`	`Cursor.execute()` and `executemany()` — wraps prepare-and-execute round-trip.
`fetch`	`fetchone()` / `fetchmany()` / `fetchall()` combined.
`close`	`Connection.close()` — `CloseDatabasePacket` round-trip + socket teardown.

All cursors created from a connection report into the same TimingStats. Stats are per-connection, cumulative since the last reset().

Overhead and thread-safety¶

Disabled — the timing module is not imported; per-call cost is a single attribute read (self._timing is None).
Enabled — two time.perf_counter_ns() calls plus a lock-protected accumulator update per hook (~hundreds of nanoseconds).
The threading.Lock inside TimingStats lets a monitoring thread safely read counters while a worker thread drives the connection. The connection itself remains threadsafety = 1 (one connection per thread).

Limitations¶

Async timings include event-loop scheduling latency; treat them as client-side end-to-end latency, not pure server time.
Counters are cumulative only — there is no per-statement history. If you need per-statement breakdowns, run the cProfile-based scripts in Performance Investigation.
ping() and commit() / rollback() are not currently timed.

Running Benchmarks¶

Clone the benchmark suite: git clone https://github.com/cubrid-lab/cubrid-benchmark.
Start benchmark containers and databases as documented in that repository.
Run the Python benchmark scenario (pycubrid vs PyMySQL) with the provided runner.
Execute multiple rounds (the published run used 10000 rows x 5 rounds).
Export and compare result artifacts (JSON/markdown tables) for trend analysis.

For exact commands and benchmark harness details, use the benchmark repo README and scripts.