Hi New to controlflow There is lots of great documentation t Prefect Community #marvin-ai

Hi! New to controlflow. There is lots of great d...

Nat Taylor

10/04/2024, 7:53 PM

Hi! New to controlflow. There is lots of great documentation to read and lots to learn. For fun I thought I'd try to clone NotebookLM deep dive's script generation. I'm not getting the output I hoped for--with an error about half the time and a short back-and-forth otherwise. Is it a bad practice to pass very long context? What should I change in my prompt to avoid "openai.APIError: The model produced invalid content. Consider modifying your prompt if you are seeing this error persistently."

deep.py

Nat Taylor

10/04/2024, 7:54 PM

Here's some example output from when there isn't an error I was surprised to see a turn repeated

Untitled

Nat Taylor

10/04/2024, 8:15 PM

I thought it might be helpful to see the API traces. It wasn't obvious to me how to turn on logging (skill issue 🙃 ) but I did eventually arrive to the following to see what's going on

Copy code

import logging
logging.getLogger("controlflow").setLevel(logging.DEBUG)
logging.getLogger("openai").setLevel(logging.DEBUG)

I was also pleased that langtrace worked with no hiccups via this, to get a nice UI

Copy code

from langtrace_python_sdk import langtrace
langtrace.init(**config)

Nat Taylor

10/04/2024, 8:15 PM

Anyway, I'm humbled -- I thought I'd pick up this agentic stuff fast 😂

Jeremiah

10/04/2024, 8:33 PM

Hey @Nat Taylor! This is really cool and I love the idea

Nat Taylor

10/04/2024, 8:34 PM

Thanks! I feel the same way about controlflow 😊

Jeremiah

10/04/2024, 8:34 PM

🙂

Jeremiah

10/04/2024, 8:35 PM

Quick notes: • controlflow.settings.log_level = 'DEBUG' will be slightly cleaner than going via logging module • that OpenAI error is EXTREMELY frustrating -- there is some bug on the API side where the model attempts to generate an invalid tool call. We have been unable to figure out how or why it happens, though we have been able to demonstrate it is deterministic by recreating it with the OpenAI native library (no ControlFlow). Because of that, and the fact that it's on the API side, we haven't been able to find a way around it. Sorry you hit it, I wish I had a more constructive piece of advice. As you iterate it will hopefully just naturally disapper.

Jeremiah

10/04/2024, 8:36 PM

Now on to the meat of your questions -- from a strictly "framework" point of view, CF is doing what you asked it to do, but here are a couple of ways you can get better results

Nat Taylor

10/04/2024, 8:38 PM

BRB

Jeremiah

10/04/2024, 8:41 PM

The double "turn" is a little tricky - Bill is posting a message to the internal thread, and then all other conversation is happening as the agents pass the virtual mic back and forth. From a technical perspective, Bill's initial message and the subsequent delegation to Hillary are both part of the same "turn", as a turn is defined as any single agent being invoked one or more times. The reason a turn and an invocation aren't the same is that when an agent uses a tool, it frequently needs to be shown the tool result in a subsequent call. So both of those LLM invocations would constitute a single "turn". On the main branch I've actually been adding a utility that prevents agents from positing messages like that since it sometimes leads to redundant outcomes like this! Should be in a release sometime in the next few days.

Jeremiah

10/04/2024, 8:43 PM

You might get better results if you split up your single task into many tasks. The agents are nominally complying with your instructions but its all very implicit. Something like:

Copy code

@cf.flow
def podcast():
    lines = []
    while True:
        
        next_line = bill.run("generate the next line in the podcast")
        lines.append(next_line)
        next_line = hillary.run("generate the next line in the podcast")
        lines.append(next_line)

        # break somehow
    return '\n\n'.join(lines)

Jeremiah

10/04/2024, 8:45 PM

In this setup, its much more explicit that you want the agents to generate a line, though you give up the more natural yielding to each other via the delegation tool. However it might improve your ability to collect and introspect the outcome

Jeremiah

10/04/2024, 8:47 PM

The other advantage is that you can determine when to break the loop yourself or with a third task; in your single-task setup, the task is going to end the second the agents believe they have satisfied your objective, which may be too short. This is one of those fuzzy "we're all learning agent best practices" zones - I'm not sure which approach will get better outcomes for sure, but wanted to show you different ways of thinking about how to engage agents. Your approach is like a one-shot ask "hey agents, please do this" and the explicit loop takes much tighter control of the situation. There are many approaches in between!

Jeremiah

10/04/2024, 8:48 PM

By the way, if you want the agents to go back and forth instead of delegating to each other, this is a relevant example in the docs

Jeremiah

10/04/2024, 8:49 PM

Oh and it is not bad practice to pass long context, though now I am wondering if the fact that context is shown to the agent after the task definition could affect performance. I don't think it should

Dave Aitel

10/04/2024, 10:29 PM

ok, so in my experience this error is often when you try to send a very long string into the model that is repetative. Aka, one word over and over again.

Nat Taylor

10/05/2024, 1:45 AM

The text I am using has some repititons. I will try removing those

...it...it...it...it...it...it...

What kind of subjects do you remember? I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember. I remember.

I was like, oh, I'm sorry. I was like, oh, I'm sorry. I was like, oh, I'm sorry. I was like, oh, I'm sorry. I was like, oh, I'm sorry. I was like, oh, I'm sorry. I was like, oh, I'm sorry. I was like, oh, I'm sorry.

They were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women. And they were for women.

This text is the output of mlx-whisper, so maybe its a strange feedback cycle where the model gets tripped up during the transcription, then also tripped up as input tokens (although I don't have any hypotheses on what "tripped up" really means.)

transcript.txt

Nat Taylor

10/05/2024, 1:45 AM

I will try all these suggestions. Thank you!

Nat Taylor

10/05/2024, 2:56 AM

openai.APIError: The model produced invalid content. Consider modifying your prompt if you are seeing this error persistently.

at least in my case it was often (always?) the result of an EMPTY response from the API, so I followed this thread: https://community.openai.com/t/empty-text-in-the-response-from-the-api-after-few-calls/2067/4 which says a space/newline at the end of the prompt can cause issues. So I added the string "Please follow the instructions." to the end of

llm_instructions.jinja

as a lazy way to avoid trailing newlines and I haven't hit the error since. Maybe there's an opportunity to add a

strip()

around the prompt as an experiment?

Nat Taylor

10/05/2024, 3:32 AM

Looping is producing much better results - thank you!

Untitled

Jeremiah

10/08/2024, 11:32 PM

@Nat Taylor sorry for not seeing this -- if you've solved the mystery of that OpenAI error message that would be INCREDIBLE

Dave Aitel

10/08/2024, 11:33 PM

I ran into it today too while trying gpt4o-mini

Dave Aitel

10/08/2024, 11:36 PM

So if you want to repro it that might be na easy way

Nat Taylor

10/09/2024, 7:32 PM

😕 my suggestion may help but it's not the antidote

Dave Aitel

10/09/2024, 7:43 PM

We really want controlflow to do managed retries though, right ?

Jason

10/16/2024, 3:58 PM

Hmm, I just ran into this today. And I can't reproduce it. Did anyone ever figure this out?

Dave Aitel

10/16/2024, 4:04 PM

This is something that has always happened with the openai models (aka not just controlflow) and it's something that you can really only handle with catching the exception and retrying.

Jeremiah

10/16/2024, 4:16 PM

@Jason (or anyone in this thread) - do you have any example of the code that produced it? My code that used to cause the error is working now so I'm having trouble testing a solution

Jason

10/16/2024, 4:37 PM

its just running a task within a cf.Flow block. I presumed maybe it has to do with my prompt formatting(?), but that wouldn't necessarily explain the rarity of it.

Dave Aitel

10/16/2024, 4:43 PM

Switch to gtp4o-mini and it will probably be easy to repro

Jeremiah

10/16/2024, 6:34 PM

@Jason is there any way to share what your code is or a stripped down version? It has something to do with formatting rather than complexity. I used to have an example that was literally a single one line task that I could replicate as a raw call to the OpenAI API and get the error

25 Views

Open in Slack

Previous Next