Effective Debugging Techniques in Python: Tools and...

Introduction

Why do bugs linger? Because we often treat debugging as an afterthought instead of a skill. Effective debugging is a mix of detective work, tooling, and disciplined code design. This post teaches practical debugging techniques in Python — from quick inspections to advanced profiling — so you can locate, understand, and fix problems faster.

You'll learn:

Key debugging concepts and prerequisites
Hands-on examples (pdb, logging, pytest, async/await debugging)
Profiling and memory analysis
Strategies for debugging automations and dashboards (including Plotly)
Best practices, common pitfalls, and advanced tips

Let's dive in.

Prerequisites

Before debugging effectively, ensure you have:

Python 3.x installed (3.8+ recommended)
Basic knowledge of functions, exceptions, and modules
Familiarity with the terminal and a code editor (VS Code, PyCharm, or similar)
Optional but useful: pytest (for tests), ipython/ipdb, and basic understanding of asyncio for async code

If you use VS Code or PyCharm, make sure their Python debugger is configured.

Core Concepts

Break the debugging workflow into approachable steps:

Reproduce the bug reliably — create the smallest test case.
Gather information — logs, stack traces, and failure conditions.
Hypothesize the cause — isolate variables and functions.
Verify with controlled experiments — unit tests or small scripts.
Fix and write regression tests.

Key tools and techniques:

print() vs logging
Interactive debuggers (pdb, ipdb, pudb)
IDE debuggers (breakpoints, watch expressions)
Unit tests and test runners (pytest)
Profilers (cProfile, timeit)
Memory tools (tracemalloc, memory_profiler)
Async-aware debugging (asyncio, await inspection)
Post-mortem debugging and core dumps

Why not just print()? — print vs logging

Many developers reach for print(). It's quick, but it has limitations:

No levels (INFO/DEBUG/WARNING/ERROR)
Hard to disable in production
No timestamps or context

Use Python's built-in logging module for scalable, structured diagnostics.

Example: upgrading prints to logging

# debug_logging_example.py
import logging
logging.basicConfig(
    level=logging.DEBUG,
    format="%(asctime)s %(levelname)s %(name)s: %(message)s"
)
logger = logging.getLogger("example")
def divide(a, b):
    logger.debug("divide called with a=%r, b=%r", a, b)
    if b == 0:
        logger.error("Attempt to divide by zero: a=%r, b=%r", a, b)
        raise ValueError("b must not be zero")
    result = a / b
    logger.info("division result: %r", result)
    return result
if __name__ == "__main__":
    try:
        divide(10, 0)
    except Exception as e:
        logger.exception("Unhandled exception")

Line-by-line:

import logging: import the standard library logging module.
basicConfig(...): configure global logging level and format.
getLogger("example"): create a named logger.
logger.debug(...): log debug details (only shown if level <= DEBUG).
logger.error(...): log an error condition with structured data.
logger.exception(...): logs traceback; use in an exception handler.

Inputs/outputs:

Input: divide(10, 0) triggers a ValueError.
Output: structured logs with timestamp and trace.

Edge cases:

In libraries, avoid basicConfig() — expect the application to configure logging.
For multiprocessing or async apps, use appropriate handlers (QueueHandler, etc.).

Call to action: Replace ad-hoc prints with logging in your projects. Try toggling logging levels to switch between verbose development output and concise production logs.

Interactive Debuggers: pdb, ipdb, pudb

Interactive debuggers let you pause execution, inspect variables, and step through code.

Minimal example with pdb:

# pdb_example.py
def greet(name):
    message = f"Hello, {name.upper()}"
    return message
def main():
    name = "world"
    import pdb; pdb.set_trace()  # breakpoint
    print(greet(name))
if __name__ == "__main__":
    main()

How to use:

Run python pdb_example.py.
When execution stops, use commands: n (next), s (step), c (continue), p variable (print), l (list source).

Line-by-line:

import pdb; pdb.set_trace(): sets a breakpoint and enters the interactive prompt.
p variable: inspect variables.
n/s/c: navigate code execution.

ipdb is similar with improved prompt and tab completion; pudb provides a curses-based visual interface.

Pro tip: Instead of pdb.set_trace(), use Python 3.7+ built-in breakpoint() which respects the PYTHONBREAKPOINT env variable.

Unit Tests and Debugging with pytest

Writing small unit tests helps reproduce and prevent regressions.

Example: test-driven debugging for the divide function.

# test_divide.py
import pytest
from debug_logging_example import divide
def test_divide_normal():
    assert divide(6, 3) == 2
def test_divide_zero():
    with pytest.raises(ValueError):
        divide(1, 0)

Run:

pytest -q

Benefits:

Tests act as reproducible minimal cases.
Integration into CI prevents regressions.

Use pytest -k or -x to stop on first failure. Combine with pdb: pytest --maxfail=1 --pdb to drop into the debugger on failure.

Debugging Asynchronous Code: async and await

Async code introduces concurrency complexities. Common issues: tasks not awaited, race conditions, or event loop misuse.

Example: a simple async bug (forgetting to await)

# async_bug.py
import asyncio
async def slow_double(x):
    await asyncio.sleep(0.1)
    return x  2

async def main():
    # Bug: missing await causes coroutine object, not result.
    results = [slow_double(i) for i in range(3)]
    print("results:", results)  # prints coroutine objects
    # Correct approach:
    awaited = await asyncio.gather((slow_double(i) for i in range(3)))
    print("awaited:", awaited)
if __name__ == "__main__":
    asyncio.run(main())

Line-by-line:

slow_double: async function that sleeps and returns doubled value.
results = [slow_double(i) for i in range(3)]: creates coroutine objects; not executed.
asyncio.gather + await: runs coroutines concurrently and awaits results.

Debugging tips for async:

Use logging inside coroutines to understand scheduling.
Use asyncio.run, not loop.run_until_complete in simple scripts.
Use pytest-asyncio or pytest-aiohttp for testing async functions.
For live debugging, use aiomonitor or asyncio.Task.get_stack to inspect tasks.

Integrating with the bigger picture: if you build asynchronous dashboards (e.g., websockets serving Plotly data), debugging async code becomes essential. Make sure to test event-loop interactions and watch for un-awaited tasks.

Profiling for Performance (CPU & Time)

When bugs are performance-related, use profilers.

Simple usage with cProfile and pstats:

# profile_example.py
import cProfile
import pstats
from io import StringIO
from sample_module import heavy_work  # imagine a CPU-heavy function
pr = cProfile.Profile()
pr.enable()
heavy_work()
pr.disable()
s = StringIO()
ps = pstats.Stats(pr, stream=s).sort_stats("cumtime")
ps.print_stats(20)
print(s.getvalue())

This shows functions consuming the most cumulative time.

Use timeit for microbenchmarks and line_profiler for per-line timings. Use sampling profilers (py-spy) for low overhead in production.

Memory Debugging: tracemalloc and memory_profiler

Find memory allocation hotspots.

Example with tracemalloc:

# tracemalloc_example.py
import tracemalloc
def allocate():
    a = [i for i in range(100000)]
    b = {i: str(i) for i in range(50000)}
    return a, b
tracemalloc.start()
snapshot1 = tracemalloc.take_snapshot()
allocate()
snapshot2 = tracemalloc.take_snapshot()
top_stats = snapshot2.compare_to(snapshot1, "lineno")
for stat in top_stats[:10]:
    print(stat)

Line-by-line:

tracemalloc.start(): begin tracing allocations.
take_snapshot(): capture memory state.
compare_to: shows differences by line number.

Edge cases:

tracemalloc tracks Python memory allocations — C extensions may not be visible.
For line-by-line memory usage, use memory_profiler (@profile decorator) with psutil installed.

Post-mortem Debugging and Crash Analysis

If the application crashes in production, you may want a post-mortem analysis.

Use sys.excepthook to log unhandled exceptions with full tracebacks.
Use faulthandler to dump Python traceback on a signal (SIGUSR1) or on fatal errors: faulthandler.dump_traceback_later or faulthandler.enable().
Configure core dumps via OS and analyze with gdb + python extensions for low-level crashes.

Example: enabling faulthandler in long-running apps

import faulthandler, sys
faulthandler.enable()  # writes to stderr by default
Or redirect:
with open("faulthandler.log", "w") as f:
    faulthandler.enable(file=f)

Debugging Real-World Scripts: Automating Repetitive Tasks

Many developers write scripts to automate file processing, ETL, or web scraping. Ensure those scripts are debuggable.

Example: CSV processing automation with logging and tests

# process_csv.py
import csv
import logging
logger = logging.getLogger(__name__)
def process_row(row):
    # Example transformation that may raise KeyError
    try:
        value = int(row["quantity"]) * float(row["price"])
    except KeyError as e:
        logger.error("Missing column: %s in row: %r", e, row)
        raise
    except ValueError as e:
        logger.warning("Invalid numeric value in row: %r", row)
        return None
    return {"sku": row.get("sku"), "total": value}
def process_file(path):
    results = []
    with open(path, newline="") as csvfile:
        reader = csv.DictReader(csvfile)
        for row in reader:
            r = process_row(row)
            if r:
                results.append(r)
    return results

Debugging strategy:

Add a small unit test with a malformed CSV to reproduce.
Use logging to show which rows fail.
For large files, process a small sample to iterate faster.

This integrates with "Automating Repetitive Tasks with Python Scripts: Real-world Use Cases and Examples" — robust logging and tests make automation reliable and debuggable.

Debugging Interactive Dashboards with Plotly

When creating dashboards (Plotly Dash or Plotly + Flask), bugs can live in server code, client-side scripts, or network layers.

Common issues:

Callbacks not firing (Dash) because IDs don't match.
Race conditions between data refresh and rendering.
Slow queries causing UI timeouts.

Debugging strategies:

Reproduce the issue with a minimal app.
Log server-side events (incoming requests, callback triggers).
Use browser dev tools to inspect network requests and JS errors.
Isolate data pipeline: test the API endpoint that supplies chart data.

Example: quick Flask endpoint serving Plotly data (minimal):

# plotly_api.py (simplified)
from flask import Flask, jsonify
import logging
app = Flask(__name__)
logging.basicConfig(level=logging.DEBUG)
@app.route("/data")
def data():
    logging.debug("data endpoint called")
    # Imagine heavy DB call here
    sample = {"x": [1,2,3], "y": [4,5,6]}
    return jsonify(sample)

Debugging tips:

Test the endpoint with curl or httpie to confirm payload.
If using async background tasks to refresh data (e.g., websockets), ensure tasks are awaited and errors are handled.

If you build an interactive dashboard, combine server-side logging with client-side console inspection. Instrument user flows and add robust error-handling callbacks on the frontend.

Common Pitfalls and How to Avoid Them

Swallowing exceptions (bare except): logs vanish. Use specific exceptions, and log exceptions with stack traces.
Mutating shared state without synchronization in concurrent code: use locks or prefer immutable data.
Ignoring warnings: treat DeprecationWarning and ResourceWarning seriously during development.
Not testing edge cases: include tests for invalid inputs, empties, boundary values.
Over-optimizing before profiling: Always profile before micro-optimizing.

Advanced Tips

Use type hints + mypy to find mismatched types before runtime.
Enable stricter linters (flake8, pylint) to catch style and potential bugs.
For production, use sampling profilers (py-spy) to inspect live systems with low overhead.
Use structured logging (JSON) for easier aggregation in ELK/Graylog.
Attach a remote debugger (debugpy for VS Code) if you need to debug code running in a container or remote host.

Example: quickly attach debugpy

# remote_debug.py
import debugpy
Listen for debugger on port 5678 (do not leave this open in production)
debugpy.listen(("0.0.0.0", 5678))
print("Waiting for debugger to attach...")
debugpy.wait_for_client()
debugpy.breakpoint()
Your app logic here

Security: protect debug ports and avoid enabling this in production unless secured.

Putting It Together: Debugging Workflow Example

Scenario: An automated ETL script that feeds a Plotly dashboard sporadically stops updating. Steps:

Reproduce locally with sample data.
Add logging around ETL stages (extract, transform, load) with timestamps and IDs.
Run the script under a profiler to detect long-running tasks.
Use tracemalloc if memory spikes before failing.
If ETL runs asynchronously (asyncio), ensure all tasks are awaited and check for unhandled exceptions in tasks.
If dashboard doesn't reflect data, curl the API endpoint to verify payloads.
Add unit tests covering sample failure modes and integrate with CI.

This approach combines automating repetitive tasks, debugging async behavior, and dashboard diagnostics.

Resources and References

Official Python docs: logging, pdb, asyncio, cProfile, tracemalloc, faulthandler
pytest documentation for testing strategies
Plotly / Dash docs for dashboard development and debugging
py-spy, memory_profiler, line_profiler for performance analysis
debugpy for remote debugging

(Searching these names + "Python docs" yields the official pages for each module.)

Conclusion

Effective debugging is a blend of discipline, good tooling, and iterative experiments. Use structured logging instead of print, adopt interactive debuggers for live inspection, write unit tests to capture regressions, profile only when needed, and apply async-aware techniques when working with concurrency.

Next steps:

Pick one bug in your current project. Reproduce it, add a test, and debug using at least one tool from this guide (logging, pdb, or a profiler).
If you automate tasks or build dashboards, instrument them with structured logs and small tests.

Happy debugging — and if you try the code snippets above, tweak them and share your findings!

Effective Debugging Techniques in Python: Tools and Strategies for Developers

Introduction

Prerequisites

Core Concepts

Why not just print()? — print vs logging

Interactive Debuggers: pdb, ipdb, pudb

Unit Tests and Debugging with pytest

Debugging Asynchronous Code: async and await

Profiling for Performance (CPU & Time)

Memory Debugging: tracemalloc and memory_profiler

Post-mortem Debugging and Crash Analysis

Or redirect:

Debugging Real-World Scripts: Automating Repetitive Tasks

Debugging Interactive Dashboards with Plotly

Common Pitfalls and How to Avoid Them

Advanced Tips

Listen for debugger on port 5678 (do not leave this open in production)

Your app logic here

Putting It Together: Debugging Workflow Example

Resources and References

Conclusion

Further Reading

Was this article helpful?

Stay Updated with Python Tips

Related Posts