Code injection during object deserialization

python.code_injection_deserialization

Severity

critical

Resource

Injection

Language

Python

Description

Improper deserialization of untrusted data, possibly allowing code injection attacks.

Deserialization vulnerabilities arise when an application deserializes user-controlled data without proper validation, potentially allowing attackers to instantiate unexpected objects or execute crafted functions leading to arbitrary code execution or system exploits.

Rationale

After detecting the vulnerable site, attackers typically craft a payload that is serialized and sent to the application. If the application deserializes this payload without validation, attackers can modify the expected data structure to escalate privileges or perform unwanted actions.

In the worst case, the attackers inject an object of an unexpected type, triggering chosen code execution. This could be used to exfiltrate internal server’s data, install malware e.g. to install and persist crypto-miners, or run a reverse shell.

Consider the following Python example:

import pickle
from flask import Flask, request

app = Flask(__name__)

@app.route('/deserialize')
def unsafe_deserialize():
    # Directly deserializing user input
    serialized_data = request.data
    data = pickle.loads(serialized_data)
    return 'Deserialization successful!'

In this example, the application directly deserializes data received in a request. With Python’s pickle, this could allow an attacker to craft data that executes arbitrary code on the server, leading to severe security implications.

Remediation

If possible, do not deserialize content to objects when the source is not fully trusted.

Mitigating deserialization vulnerabilities involves several key practices:

Avoid Deserialization of Untrusted Input: The most secure approach is to avoid deserializing any untrusted data outright. If deserialization is necessary, consider other safe data formats such as JSON or XML with strict schema validation.
Use Secure Libraries: Employ libraries or frameworks that provide secure deserialization mechanisms.
Validation and Whitelisting: Implement strict validation. Use a whitelist approach where only the expected, permitted, and safe classes are allowed to be deserialized.

import json
from flask import Flask, request

app = Flask(__name__)

@app.route('/deserialize')
def safe_deserialize():
    # Use safer methods like JSON for serialization
    try:
        serialized_data = request.data
        data = json.loads(serialized_data)
    except json.JSONDecodeError:
        return 'Invalid data!', 400

    return 'Deserialization successful!'

In this improved example, json.loads() is used instead of pickle.loads(). JSON serialization does not allow arbitrary code execution, making it a safer option for handling untrusted data. Additionally, input validation is employed to ensure robustness against malformed input.

Configuration

The detector has the following configurable parameters:

sources, that indicates the source kinds to check.
neutralizations, that indicates the neutralization kinds to check.

Unless you need to change the default behavior, you typically do not need to configure this detector.

References

CWE-502 : Deserialization of Untrusted Data.
OWASP - Top 10 2021 Category A08 : Software and Data Integrity Failures.
Deserialization Cheat Sheet - OWASP Cheat Sheet Series.
PortSwigger Insecure deserialization.