Thread Pool Guide

This guide covers parallel log processing in Logly using thread pools, including configuration, task submission, batch processing, priority queues, work stealing, and best practices.

Overview

The thread pool module enables parallel log processing for high-throughput scenarios. It provides configurable worker threads, priority queues, work stealing, and parallel sink writing.

Logger Configuration

Configure thread pool settings through the logger's Config:

zig

const logly = @import("logly");

var config = logly.Config.default();
config.thread_pool = .{
    .enabled = true,              // Enable thread pool
    .thread_count = 8,            // Number of worker threads
    .queue_size = 2048,           // Max queued tasks
    .stack_size = 1024 * 1024,    // 1MB per thread
    .work_stealing = true,        // Enable work stealing
};

// Or use helper method
var config2 = logly.Config.default().withThreadPool(.{ .thread_count = 4 });

Quick Start

zig

const std = @import("std");
const logly = @import("logly");

pub fn main() !void {
    var gpa = std.heap.GeneralPurposeAllocator(.{}){};
    defer _ = gpa.deinit();
    const allocator = gpa.allocator();

    // Create thread pool with default settings
    var pool = try logly.ThreadPool.init(allocator, .{
        .thread_count = 4,
        .work_stealing = true,
    });
    defer pool.deinit();

    // Start workers
    try pool.start();
    defer pool.stop();

    // Submit tasks
    // pool.submit(...);
}

Configuration

Thread Count

zig

// Specific number of threads
.thread_count = 8

// Auto-detect (0 = CPU cores)
.thread_count = 0

Queue Size

zig

// Per-thread queue size
.queue_size = 1024

// Large queue for bursty workloads
.queue_size = 4096

Work Stealing

Enable threads to steal work from other threads' queues:

zig

.work_stealing = true

This improves load balancing when some threads finish faster than others.

Arena Allocation

Enable per-worker arena allocation for efficient memory usage:

zig

.enable_arena = true

When enabled, each worker thread maintains its own arena allocator. This is particularly useful for formatting operations, as it reduces contention on the global allocator and improves cache locality. The arena is automatically reset after each task.

Priority Queues

Enable task prioritization:

zig

.enable_priorities = true

Presets

Use built-in presets for common scenarios:

zig

// Single thread (for testing/debugging)
const single = logly.ThreadPoolPresets.singleThread();

// CPU-bound tasks (N threads, work stealing)
const cpu = logly.ThreadPoolPresets.cpuBound();

// I/O-bound tasks (2N threads, large queues)
const io = logly.ThreadPoolPresets.ioBound();

// Maximum throughput
const high = logly.ThreadPoolPresets.highThroughput();

Submitting Tasks

Basic Task Submission

zig

try pool.submit(.{
    .func = myTaskFunction,
    .context = @ptrCast(&myData),
    .priority = .normal,
    .submitted_at = std.time.milliTimestamp(),
});

fn myTaskFunction(ctx: *anyopaque) void {
    const data: *MyData = @alignCast(@ptrCast(ctx));
    // Process data...
}

Priority Levels

zig

pub const TaskPriority = enum(u8) {
    low = 0,      // Background tasks
    normal = 1,   // Regular logging
    high = 2,     // Important logs
    critical = 3, // Error/alert logs
};

Submit with Priority

zig

// Standard priority submission
_ = pool.submitCallback(myFunction, @ptrCast(&data));

// High priority (processed before normal)
_ = pool.submitHighPriority(myFunction, @ptrCast(&data));

// Critical priority (processed first)
_ = pool.submitCritical(myFunction, @ptrCast(&data));

Batch Submission

Submit multiple tasks at once for higher throughput (single lock acquisition):

zig

// Create array of tasks
var tasks: [10]logly.ThreadPool.Task = undefined;
for (&tasks) |*task| {
    task.* = .{ .callback = .{ .func = myFunction, .context = context } };
}

// Batch submit - returns number of successfully submitted tasks
const submitted = pool.submitBatch(&tasks, .normal);
std.debug.print("Submitted {} tasks\n", .{submitted});

Non-Blocking Submission

Use trySubmit for low-latency scenarios where blocking is unacceptable:

zig

// Returns immediately if lock is contended
if (pool.trySubmit(task, .high)) {
    // Task submitted successfully
} else {
    // Queue is contended or full, handle fallback
    handleFallback(task);
}

Worker Affinity

Submit to a specific worker's local queue for better cache locality:

zig

// Submit to worker 0's local queue
_ = pool.submitToWorker(0, task, .normal);

// Submit to worker 1's local queue  
_ = pool.submitToWorker(1, task, .normal);

This is useful when tasks need to access the same data, as keeping them on the same worker improves CPU cache hit rates.

Parallel Sink Writing

Write to multiple sinks concurrently with the enhanced ParallelSinkWriter:

zig

// Create with default config
var writer = try logly.ParallelSinkWriter.init(allocator, pool);
defer writer.deinit();

// Or with custom ParallelConfig
var writer = try logly.ParallelSinkWriter.initWithConfig(allocator, pool, .{
    .max_concurrent = 4,
    .retry_on_failure = true,
    .max_retries = 3,
    .buffered = true,
    .buffer_size = 64,
});
defer writer.deinit();

// Add sinks
try writer.addSink(.{
    .write_fn = &fileWriteFn,
    .flush_fn = &fileFlushFn,
    .name = "file",
});
try writer.addSink(.{
    .write_fn = &consoleWriteFn,
    .name = "console",
});

// Write to all sinks in parallel
writer.writeParallel("Log message data");

// Or use alias
writer.write("Log message data");

// Flush all buffered writes
writer.flushAll();

Configuration Options

zig

pub const ParallelConfig = struct {
    max_concurrent: usize = 8,       // Max parallel writes
    write_timeout_ms: u64 = 1000,    // Timeout per write
    retry_on_failure: bool = true,   // Retry failed writes
    max_retries: u3 = 3,             // Max retry attempts
    fail_fast: bool = false,         // Stop on first error
    buffered: bool = true,           // Buffer before dispatch
    buffer_size: usize = 64,         // Buffer size
};

ParallelConfig Presets

zig

// High throughput - max concurrent, large buffers
const config = logly.ParallelConfig.highThroughput();

// Low latency - small buffers, no retry, fail-fast
const config = logly.ParallelConfig.lowLatency();

// Reliable - more retries, longer timeouts
const config = logly.ParallelConfig.reliable();

Sink Management

zig

// Remove a sink by name
writer.removeSink("console");

// Disable a sink temporarily
writer.setSinkEnabled("file", false);

// Re-enable
writer.setSinkEnabled("file", true);

// Check if any sinks are enabled
if (writer.hasEnabledSinks()) {
    writer.write(data);
}

// Get sink count
const count = writer.sinkCount();

Parallel Write Statistics

zig

const stats = writer.getStats();

std.debug.print("Writes submitted: {d}\n", .{
    stats.writes_submitted.load(.monotonic),
});
std.debug.print("Writes completed: {d}\n", .{
    stats.writes_completed.load(.monotonic),
});
std.debug.print("Writes failed: {d}\n", .{
    stats.writes_failed.load(.monotonic),
});
std.debug.print("Retries: {d}\n", .{
    stats.retries.load(.monotonic),
});
std.debug.print("Bytes written: {d}\n", .{
    stats.bytes_written.load(.monotonic),
});
std.debug.print("Success rate: {d:.2}%\n", .{
    stats.successRate() * 100,
});

Statistics

Monitor pool performance:

zig

const stats = pool.getStats();

std.debug.print("Tasks completed: {d}\n", .{
    stats.tasks_completed.load(.monotonic),
});
std.debug.print("Tasks stolen: {d}\n", .{
    stats.tasks_stolen.load(.monotonic),
});
std.debug.print("Throughput: {d:.2} tasks/sec\n", .{
    stats.throughput(),
});
std.debug.print("Avg wait time: {d}ns\n", .{
    stats.averageWaitTimeNs(),
});
std.debug.print("Avg exec time: {d}ns\n", .{
    stats.averageExecTimeNs(),
});

Use Cases

1. High-Volume Logging

zig

// Use high throughput preset
var pool = try logly.ThreadPool.init(
    allocator,
    logly.ThreadPoolPresets.highThroughput(),
);

2. Multiple Log Destinations

zig

// Write to file, console, and network simultaneously
var writer = try logly.ParallelSinkWriter.init(allocator, .{
    .max_concurrent = 3,
});
try writer.addSink(&file_sink);
try writer.addSink(&console_sink);
try writer.addSink(&network_sink);

3. Batch Processing

zig

// Process log batches in parallel
for (log_batches) |batch| {
    try pool.submit(.{
        .func = processBatch,
        .context = @ptrCast(&batch),
    });
}
pool.waitIdle();

4. Priority-Based Logging

zig

// Critical logs get processed first
try pool.submitWithPriority(logError, &error_data, .critical);

// Regular logs processed normally
try pool.submitWithPriority(logInfo, &info_data, .normal);

// Debug logs processed last
try pool.submitWithPriority(logDebug, &debug_data, .low);

Work Stealing

Work stealing improves efficiency when:

Tasks have variable execution times
Some threads finish faster than others
You want better CPU utilization

zig

var pool = try logly.ThreadPool.init(allocator, .{
    .work_stealing = true, // Enable work stealing
    .thread_count = 8,
});

How It Works

Each thread has its own work queue
When a thread's queue is empty, it "steals" from others
Stealing is done from the back of other queues
This balances work across all threads

Integration with Async Logger

Combine thread pools with async logging:

zig

var async_logger = try logly.AsyncLogger.init(allocator, .{
    .buffer_size = 8192,
});

var pool = try logly.ThreadPool.init(allocator, .{
    .thread_count = 4,
});

// Use pool for parallel sink writing
var parallel_writer = try logly.ParallelSinkWriter.init(allocator, .{});

// Connect components...

Best Practices

1. Choose Appropriate Thread Count

zig

// CPU-bound: Use CPU core count
.thread_count = std.Thread.getCpuCount() catch 4

// I/O-bound: Use 2x CPU cores
.thread_count = (std.Thread.getCpuCount() catch 4) * 2

2. Size Queues Appropriately

zig

// For bursty workloads, use larger queues
.queue_size = 4096

// For steady workloads, smaller is fine
.queue_size = 256

3. Handle Queue Full

zig

pool.submit(task) catch |err| {
    if (err == error.QueueFull) {
        // Handle backpressure
        // - Wait and retry
        // - Drop the task
        // - Expand queue
    }
};

4. Graceful Shutdown

zig

// Stop accepting new tasks
pool.stop();

// Or with timeout
pool.stopWithTimeout(5000); // 5 second timeout

5. Monitor Performance

Regularly check statistics to identify bottlenecks:

zig

const stats = pool.getStats();
if (stats.tasks_dropped.load(.monotonic) > 0) {
    // Queue overflow - increase queue size or threads
}

Error Handling

zig

try pool.submit(task) catch |err| {
    switch (err) {
        error.PoolNotRunning => {
            // Pool hasn't started or has stopped
        },
        error.QueueFull => {
            // All queues are full
        },
        error.OutOfMemory => {
            // Memory allocation failed
        },
    }
};

Performance Considerations

Thread overhead: Each thread has memory overhead (~8KB stack)
Context switching: Too many threads can cause overhead
Cache locality: Work stealing may affect cache performance
Lock contention: Minimize shared state between tasks

Example: Production Setup

zig

const std = @import("std");
const logly = @import("logly");

pub fn main() !void {
    var gpa = std.heap.GeneralPurposeAllocator(.{}){};
    defer _ = gpa.deinit();
    const allocator = gpa.allocator();

    // Production thread pool config
    const cpu_count = std.Thread.getCpuCount() catch 4;
    
    var pool = try logly.ThreadPool.init(allocator, .{
        .thread_count = cpu_count,
        .queue_size = 2048,
        .work_stealing = true,
        .enable_priorities = true,
        .shutdown_timeout_ms = 10000,
    });
    defer pool.deinit();

    // Parallel sink writer
    var writer = try logly.ParallelSinkWriter.init(allocator, .{
        .max_concurrent = 4,
        .retry_on_failure = true,
        .max_retries = 3,
    });
    defer writer.deinit();

    try pool.start();
    defer pool.stop();

    // Application runs...
    
    // Check final stats
    const stats = pool.getStats();
    std.debug.print("Total processed: {d}\n", .{
        stats.tasks_completed.load(.monotonic),
    });
}

Method Aliases

The ThreadPool provides convenient aliases for common operations:

Method	Alias	Description
`waitAll()`	`await()`, `join()`	Wait for all tasks to complete
`submit()`	`push()`, `enqueue()`	Submit a task
`submitFn()`	`run()`	Submit a function
`pendingTasks()`	`queueDepth()`, `size()`	Get pending task count
`activeThreads()`	`workerCount()`	Get active thread count
`clear()`	`discard()`	Clear pending tasks

zig

// These are equivalent:
pool.waitAll();
pool.await();
pool.join();

// Submit aliases:
pool.submit(task, .normal);
pool.push(task, .normal);
pool.enqueue(task, .normal);

// Check status:
const pending = pool.size();  // Same as pendingTasks()
const workers = pool.workerCount();  // Same as activeThreads()

// Control:
pool.clear();  // Discard pending tasks
pool.discard();  // Same as clear()

// Status checks:
if (pool.isRunning()) { ... }
const total = pool.threadCount();

New Methods (v0.0.9)

zig

var pool = try logly.ThreadPool.init(allocator, config);
defer pool.deinit();

// State methods
const empty = pool.isEmpty();
const full = pool.isFull();

// Performance metrics
const util = pool.utilization();  // 0.0 - 1.0

// Reset statistics
pool.resetStats();

Additional Aliases

Alias	Method
`flush`	`clear`
`statistics`	`getStats`
`stop`	`shutdown`
`halt`	`shutdown`
`begin`	`start`
`add`	`submit`

Thread Pool Guide ​

Overview ​

Logger Configuration ​

Quick Start ​

Configuration ​

Thread Count ​

Queue Size ​

Work Stealing ​

Arena Allocation ​

Priority Queues ​

Presets ​

Submitting Tasks ​

Basic Task Submission ​

Priority Levels ​

Submit with Priority ​

Batch Submission ​

Non-Blocking Submission ​

Worker Affinity ​

Parallel Sink Writing ​

Configuration Options ​

ParallelConfig Presets ​

Sink Management ​

Parallel Write Statistics ​

Statistics ​

Use Cases ​

1. High-Volume Logging ​

2. Multiple Log Destinations ​

3. Batch Processing ​

4. Priority-Based Logging ​

Work Stealing ​

How It Works ​

Integration with Async Logger ​

Best Practices ​

1. Choose Appropriate Thread Count ​

2. Size Queues Appropriately ​

3. Handle Queue Full ​

4. Graceful Shutdown ​

5. Monitor Performance ​

Error Handling ​

Performance Considerations ​

Example: Production Setup ​

Method Aliases ​

See Also ​

New Methods (v0.0.9) ​

Additional Aliases ​

Thread Pool Guide

Overview

Logger Configuration

Quick Start

Configuration

Thread Count

Queue Size

Work Stealing

Arena Allocation

Priority Queues

Presets

Submitting Tasks

Basic Task Submission

Priority Levels

Submit with Priority

Batch Submission

Non-Blocking Submission

Worker Affinity

Parallel Sink Writing

Configuration Options

ParallelConfig Presets

Sink Management

Parallel Write Statistics

Statistics

Use Cases

1. High-Volume Logging

2. Multiple Log Destinations

3. Batch Processing

4. Priority-Based Logging

Work Stealing

How It Works

Integration with Async Logger

Best Practices

1. Choose Appropriate Thread Count

2. Size Queues Appropriately

3. Handle Queue Full

4. Graceful Shutdown

5. Monitor Performance

Error Handling

Performance Considerations

Example: Production Setup

Method Aliases

See Also

New Methods (v0.0.9)

Additional Aliases