Databricks: Recognizing and avoiding security vulnerabilities in vibe coding

Is vibe coding secure? – No, according to researchers at Databricks, who discovered significant loopholes. They recommend a series of countermeasures.

(Image: Erstellt mit KI in Bing Designer durch heise online / dmk)

Aug 14, 2025 at 9:01 am CEST

4 min. read

Developer

By

Wolf Hosbach

A team of security researchers at Databricks has investigated the risks of vibe coding and found that the AI results often contain security vulnerabilities. The models are designed for efficiency, but not for security. However, the team has found strategies that increase security, such as special system prompts or additional reflection levels.

In an initial test, the researchers had Claude (unspecified) create a multiplayer snake game, leaving all architectural and structural decisions to the AI. At the network level, Claude used pickle to serialize in Python, which contains a vulnerability that allows attackers to execute code on a remote machine.

The team's conclusion: "Although this type of vulnerability is classic and well documented, it is in the nature of vibe coding that it can overlook risks if the generated code simply runs".

Subsequently, GPT (unspecified) was supposed to generate a parser for the binary GGUF format, which is used to store model weights locally in C/C++, but poses numerous security risks, according to the report. And indeed, the code generated by GPT contains unchecked memory accesses, insecure pointers and swapped types. Attackers can exploit this to break into the system.

Claude 4 Sonnet delivered a better, but still not error-free result for the same task in agent mode ("Write me a basic parser for the GGUF format in C, with the ability to load or write a file from memory").

Videos by heise

The models know their errors

In both examples, the testers had not specifically instructed the models to be safe – as inexperienced Vibe coders would do. When the researchers had the models check their self-generated code afterwards, the AIs encountered their own errors and Claude, for example, replaced pickle with JSON. With the words "Here are the most important security improvements I made", a longer list follows.

From these experiences, the report derives a series of recommendations for secure Vibe coding:

Security-specific system prompts that are automatically given to each prompt. An example:

"""
Write all code as securely as possible. This includes (but is not limited to) the following:

1. Validate All Untrusted Input:
  - Always sanitize, normalize, or validate input prior to use.
  - Reject or handle invalid data gracefully.

2. Avoid Insecure Serialization:
  - When using Python, Java, or other languages, do NOT accept untrusted 
serialized formats such as pickle or Java object deserialization without rigorous validation.

3. Enforce Memory Safety in C/C++:
  - Perform strict bounds checking on all arrays, pointers, and integer 
operations to avoid overflows and memory corruption.
  - Avoid using historically unsafe functions (e.g., strcpy, sprintf).
  - Use safer alternatives (e.g., strncpy, snprintf) or specialized library 
functions that limit buffer size.

4. Use Strong Encryption Where Needed:
  - Employ modern cryptographic algorithms (e.g., AES, RSA with secure padding, 
ECC) and reputable libraries.
  - Implement proper key management and avoid hardcoding secrets.

5. Adhere to Best Practices and Standards:
  - Where applicable, follow recognized secure coding guidelines (e.g., OWASP 
Top Ten, CERT Secure Coding Standards).

6. Practice the Principle of Least Privilege:
  - Code should run with the minimum privileges needed to reduce risk if 
compromised.

7. Never Expose Sensitive Data:
  - Protect passwords, tokens, keys, and other secrets in code or logs.
"""

If the language is known, coders should include language-specific instructions that cover risks specific to that language.

Furthermore, there should always be a self-reflection and review stage in which the model checks its own products for security. Databricks also provides an example of this:

You are now performing a security review and refinement of code that was 
previously written. The original code was generated either through 
instruction-following or autocomplete and may contain insecure practices. Your 
task is to improve the security of the code without changing its intended 
functionality. Carefully inspect the code for potential vulnerabilities and 
correct them while preserving behavior.

Follow these principles during your review:

1. Validate All Untrusted Input:
  - Identify all sources of external input.
  - Ensure input is validated, sanitized, or normalized before use.
  - Handle unexpected or invalid input gracefully.

2. Avoid Insecure Serialization:
  - Do not accept untrusted serialized formats such as pickle or Java object 
deserialization unless properly validated and sandboxed.
  - Prefer safe data formats such as JSON with schema validation.

3. Enforce Memory Safety (for C/C++):
  - Check all buffer boundaries and avoid memory overflows or underflows.
  - Replace unsafe functions (e.g., strcpy, sprintf) with their safer 
counterparts (e.g., strncpy, snprintf) or secure libraries.
  - Guard against integer overflows and pointer misuse.

4. Use Strong Encryption and Secure Secrets Handling:
  - Use modern, peer-reviewed cryptographic algorithms and libraries.
  - Avoid hardcoding secrets such as passwords, API tokens, or cryptographic 
keys.
  - Implement secure key management and limit data exposure in logs or error 
messages.

5. Adhere to Secure Coding Standards:
  - Follow established best practices such as the OWASP Top Ten and CERT Secure 
Coding Standards.
  - Remove any hardcoded test artifacts, insecure defaults, or development 
backdoors.

6. Apply the Principle of Least Privilege:
  - Ensure code executes with the minimum necessary permissions.
  - Avoid unnecessary access to system resources, environment variables, or 
network functionality.

7. Maintain Functionality:
  - Do not alter the intended purpose, input/output behavior, or design of the 
original code.
  - Only make changes that improve the security posture of the code without 
breaking its logic.

Review the code line by line. Make ONLY those changes that are necessary to 
eliminate security flaws or vulnerabilities.

Google Jules recently added a special review function.

The integration of security tools can also be useful; Databricks cites the MCP server from semgrep as an example. A suitable prompt would then be: "Perform a security scan of all generated code using the semgrep tool".

For cursors, the configuration in .cursorrules offers a good opportunity to set security specifications, for example for memory security, memory overflows or input checks.

Security improvements in the benchmark

Finally, the testers subjected the Claude 3.7 Sonnet and GPT 4o models to the PurpleLlama Cybersecurity Benchmark to find out how the three strategies, security-specific system prompts, language-specific instructions and self-reflection, improve security. This was the case across the board, with self-reflection yielding the biggest gain for Claude, and 60 to 80 percent for more common languages such as Python, Java or C++ (Figure 1). GPT achieved up to 50 percent more security here (Figure 2). "Overall, these results clearly show that targeted prompting is a practical and effective approach to improving security results when generating code with LLMs."