Published July 7, 2024 © Apache-2.0

Blender AI Assistant

Create in 3D with plain English

BeginnerFull instructions provided30 minutes1,427

Generative AI - Radeon Pro W7900: 3rd Place

Pervasive AI Developer Contest

Things used in this project

Hardware components

AMD Radeon Pro W7900

This professional GPU has 48 GB of VRAM allowing for complex content creation while also supporting large LLMs such as Llama 3 70b locally for fast and private LLM usage.

Software apps and online services

Blender 3D

The content creation tool for which the add-on in written for.

Ollama

Quickly setup and serve local LLMs.

Ubuntu 22.04.4 LTS

Linux OS distribution that supports ROCm.

Story

Blender is a powerful tool, but sometimes it's easier to use plain English to accomplish your task.

This project presents a Blender add-on that adds an interface for communicating with Llama 3 70b which can perform tasks in Blender 3D given English instructions.

VideoDemo:

Installation and demo of the Blender add-on

How does it work?

TL;DR: Llama 3 70b (a powerful LLM for which the AMD Radeon Pro W7900 is large enough to handle while also managing 3D content creation) is configured to be a Blender Python API assistant and your commands are converted into API calls. Erroneous commands are caught and the LLM (with a slightly different configuration) is re-prompted to fix them.

Blender is a free and open-source 3D content creation suite. Its interface is written in Python, meaning there exists a Python command for pretty much everything. An add-on to Blender is a Python script that can invoke arbitrary Blender commands.

LLMs like Llama 3 70b have been trained on Blender's Python API and also understand how to structure Python code very well, all that is left is a means for prompting and a device to perform inferencing quickly. That is where the AMD Radeon Pro W7900 comes in. This pro-level GPU has 48 GB of VRAM which is enough to host Llama 3 70b (with minimal quantization) and perform content creation in Blender.

To quickly set up and manage the LLM, Ollama was used. It supports ROCm GPUs and can efficiently manage the device's VRAM.

To help with building the Python commands, a bit of prompt engineering was put in. When handling a prompt, the model is instructed about the context of what it is doing (Blender Python) and told to ignore unrelated tasks and only answer with code (no explanation). That way responses are faster and the commands can be easily parsed. Along with the user's prompt, a dictionary with the current objects in the scene and their locations are also supplied to the model as context. Once the model generates the appropriate Python code, it is executed in Blender's interpreter and the command is shown in the UI for reference. If an error occurs while trying to evaluate the command (since LLMs can sometimes make mistakes), a new prompt is generated that contains the Python traceback and given to the LLM to fix in a recursive loop. If it can't resolve the error after a few tries it will show the error.

Usage

This add-on was tested on Ubuntu 22.04.4 LTS with ROCm 6 installed. The add-on is not dependent on the OS or specific ROCm version though so if you can get ROCm installed and Ollama running with ROCm support then that's all that is needed.

1. Install Blender 3D

2. Install ROCm

2. Install and serve Ollama from http://localhost:11434. Using Docker is the easiest method.

3. From the code section or the GitHub repo, download the add-on script called ai_assistant.py.

4. Launch Blender. From Edit > Preferences > Add-ons > Install, select the downloaded script. Click the enable checkbox once installed.

5. Press "n" to open the sidebar. Select the "AI Assistant" tab to show the UI.

6. Type in a command like "Add a cube above the current one" and press submit.

After some thinking, you should see a cube appear above the default one!

If this is the first time Ollama and the Llama 3 70b model is being used you may have to wait a while for Ollama to download and initialize the model. Downloads are cached and the model will live in VRAM for quick access. After 5 minutes of inactivity it will be unloaded (Ollama's default).

Where to go from here?

All code is free and open-source and can be modified to your liking. For example, the model prompt settings are on lines 46 - 49. One may want to extend the time the LLM lives in VRAM. To do so, simply add the option keep_alive = -1 to keep the model indefinitely (or = 30m for 30 minutes).

One future direction is to try and curate a training data set to fine-tune these models. However, Llama 3 70b is pretty good already.

Feedback is appreciated in the command and issue board. Thank you!

AI Assistant add-on

bl_info = {
    "name": "AI Assistant",
    "author": "Kenneth Y (Microbob)",
    "blender": (2, 80, 0),
    "version": (0, 0, 1),
    "category": "3D View",
}

import bpy
import requests
import threading
import traceback
import functools


result = ""


class ASSISTANT_PT_Panel(bpy.types.Panel):
    bl_label = "AI Assistant"
    bl_idname = "ASSISTANT_PT_panel"
    bl_space_type = "VIEW_3D"
    bl_region_type = "UI"
    bl_category = "AI Assistant"

    def draw(self, context):
        layout = self.layout
        scene = context.scene
        assistant_props = scene.assistant_props

        layout.prop(assistant_props, "input_text")
        layout.operator(
            "assistant.submit",
            text="Generating" if assistant_props.is_generating else "Submit",
            emboss=not assistant_props.is_generating,
        )
        layout.label(text="Response:")
        for line in assistant_props.output_text.split("\n"):
            layout.label(text=line)


def send_request(prompt, system, flag, recurse_count=0):
    url = "http://localhost:11434/api/generate"
    headers = {"Content-Type": "application/json"}
    data = {
        "model": "llama3:70b",
        "prompt": prompt,
        "stream": False,
        "system": system,
    }

    global result

    try:
        response = requests.post(url, headers=headers, json=data)
    except requests.RequestException as e:
        result = f"Request failed: {e}"
    else:
        if response.status_code == 200:
            result = response.json().get("response", "NO RESULT")

            if result == "NO RESULT":
                flag.set()
                raise ValueError("No result field found in API call.")
        else:
            result = f"Error {response.status_code}: {response.text}"
            flag.set()
            return

        # Extract code fragment.
        try:
            start_of_fragment_index = result.index("```")
        except ValueError:
            # Show response if no code.
            flag.set()
        else:
            end_of_fragment_index = (
                result[start_of_fragment_index + 3 :].index("```")
                + start_of_fragment_index
                + 3
            )
            code = result[start_of_fragment_index + 3 : end_of_fragment_index]
            result = code

            # Try to run the command
            try:
                exec(code)
            except Exception:
                # Get the error and request a fix.
                error = traceback.format_exc()

                # Stop trying after a few times (avoid infinite recursion).
                if recurse_count > 3:
                    result = error
                    flag.set()
                else:
                    # Recurse with new prompt and setup.
                    send_request(
                        f"Command: {prompt}\nError: {error}",
                        "You are a programming assistant for Blender 3D's Python API. I will give you a Blender Python command that didn't work and the associated error it produced. Fix the command with the correct Blender Python code and place it between three tick marks like this '```'. Do not explain your answer. No need to import bpy. If a command is not possible, respond with `not possible` and a one sentence description of why. If my question is not related to Blender, respond with `not possible` and a one sentence description of why.",
                        flag,
                        recurse_count + 1
                    )
            else:
                # On successful exec.
                flag.set()


class ASSISTANT_OT_Submit(bpy.types.Operator):
    bl_idname = "assistant.submit"
    bl_label = "Submit"

    def execute(self, context):
        scene = context.scene
        assistant_props = scene.assistant_props

        # Skip if already generating.
        if assistant_props.is_generating:
            return {"FINISHED"}

        assistant_props.is_generating = True

        # Generate object-location dictionary.
        scene_info = {}
        for object in scene.objects:
            scene_info[object.name] = object.location

        # Run the request in a separate thread.
        thread_flag = threading.Event()
        thread = threading.Thread(
            target=send_request,
            args=(
                assistant_props.input_text,
                "You are a programming assistant for Blender 3D's Python API. I will ask you to perform actions in Blender and you will respond with the corresponding Blender Python commands surrounded by three tick marks like this '```'. Do not explain your answer. No need to import bpy. If a command is not possible, respond with `not possible` and a one sentence description of why. If my question is not related to Blender, respond with `not possible` and a one sentence description of why. Here is a list of objects in the scene and their locations: {scene_info}",
                thread_flag,
            ),
        )
        thread.start()

        check_thread = functools.partial(self.update_scene, context, thread_flag)
        bpy.app.timers.register(check_thread)

        return {"FINISHED"}

    def update_scene(self, context, flag):
        if flag.is_set():
            global result

            context.scene.assistant_props.is_generating = False
            context.scene.assistant_props.output_text = result

            # Reset after using it.
            result = ""

            return None  # Return None to indicate the callback should not be repeated
        return 0.1


class ASSISTANT_Props(bpy.types.PropertyGroup):
    input_text: bpy.props.StringProperty(
        name="Input Text", description="Describe what you'd like to do", default=""
    )
    output_text: bpy.props.StringProperty(
        name="Output Text", description="Response from the assistant", default=""
    )
    is_generating: bpy.props.BoolProperty(
        name="Is Generating",
        description="Indicates whether the assistant is generating a response",
        default=False,
    )


def register():
    bpy.utils.register_class(ASSISTANT_PT_Panel)
    bpy.utils.register_class(ASSISTANT_OT_Submit)
    bpy.utils.register_class(ASSISTANT_Props)
    bpy.types.Scene.assistant_props = bpy.props.PointerProperty(type=ASSISTANT_Props)


def unregister():
    bpy.utils.unregister_class(ASSISTANT_PT_Panel)
    bpy.utils.unregister_class(ASSISTANT_OT_Submit)
    bpy.utils.unregister_class(ASSISTANT_Props)
    del bpy.types.Scene.assistant_props


if __name__ == "__main__":
    register()