Python API

Agentic Flink offers two Python paths, each suited to a different use case. Pick one — they don't compose, and mixing them in one process is not supported.

Path	Module	Use when	Mechanism
PyFlink-native (recommended for streaming)	`agentic_flink.pyflink`	You're shipping a real PyFlink job (`flink run -py …`)	Python builds an `AgentPlan` (JSON) → Java `CompileUtils.attachAgent` wires the agent operator into the job graph → PEMJA invokes Python callbacks on the operator thread
JPype standalone	`agentic_flink` (top-level)	Notebooks, scripts, services with no PyFlink dep	Boots a JVM in the Python process; calls Java through JNI

The Java framework is the single source of truth for both paths. The sections below cover the JPype path; for PyFlink-native, see pyflink-integration.md.

JPype standalone path

agentic-flink (top-level) is a thin JPype-backed facade over the Java framework. The JVM runs in-process (thread mode); Python and Java share threads, and calls cross JNI without serialization. The Java framework is the single source of truth — the Python package adds Pythonic ergonomics on top.

Install

pip install agentic-flink

The package depends only on JPype1. PyFlink users can opt into the PyFlink dep group:

pip install agentic-flink[pyflink]

You also need the framework jar somewhere the bootstrap can find it. Discovery order:

AGENTIC_FLINK_JAR environment variable.
jar_path= kwarg to start_jvm.
Sibling Maven build (../target/agentic-flink-*.jar) — covers editable installs from a checkout.
Bundled package data (when shipping a wheel that includes the jar).

Build the jar yourself with mvn -DskipTests package from the repo root.

Quick start

from datetime import timedelta
import agentic_flink as af
from agentic_flink import Agent, ChatSetup, langchain4j_ollama, tool

af.start_jvm()

@tool(name="add", description="Add two integers")
def add(a: int, b: int) -> int:
    return a + b

agent = (
    Agent.builder()
        .with_id("calc-bot")
        .with_system_prompt("You are a calculator.")
        .with_chat_connection(langchain4j_ollama())
        .with_chat_setup(ChatSetup(model="qwen2.5:3b", temperature=0.3))
        .with_tools(add)
        .with_max_iterations(5)
        .with_short_term_ttl(timedelta(minutes=30))
        .build()
)

print(agent)
# Invoke the tool through its Java ToolExecutor proxy — same path the
# agent operator uses at runtime.
HashMap = af.jclass("java.util.HashMap")
args = HashMap(); args.put("a", 2); args.put("b", 3)
print("2 + 3 =", add._to_java().execute(args).get())

The `@tool` decorator

Decorating a Python function makes it callable as a Java com.ververica.flink.agent.tools.ToolExecutor. The decorator generates a JPype proxy that converts Java Map<String, Object> parameters to Python kwargs (matched by parameter name), invokes the function, and wraps the return value in a CompletableFuture.

@tool
def greet(name: str) -> str:
    """Friendly greeting."""
    return f"hi {name}"

@tool(name="weather", description="Look up today's weather for a city")
def weather(city: str) -> dict:
    return {"city": city, "temp_c": 18.0, "summary": "cloudy"}

The Python function runs on whatever JVM thread invoked it (typically the agent operator's processElement thread). No serialization, no process boundary, no extra deps.

If the function raises, the exception flows back as CompletableFuture.failedFuture(...); the agent operator's normal error-handling kicks in.

Listeners (and other interfaces) in Python

Subclass PyAgentEventListener and override the hooks you care about:

from agentic_flink.listener import PyAgentEventListener

class StdoutListener(PyAgentEventListener):
    name = "stdout-listener"
    def on_chat_request(self, agent_id, model, message_count):
        print(f"{agent_id}: requesting {model} with {message_count} messages")
    def on_tool_call_end(self, agent_id, tool, call_id, success, duration_ms):
        print(f"{agent_id}: {tool} {'ok' if success else 'failed'} in {duration_ms}ms")

agent = Agent.builder().with_listener(StdoutListener()).build()

Same pattern works for Guardrail, Classifier, Scorer, Chunker implementations — JPype's @JImplements decorator generates the proxy.

PyFlink integration

JPype's JVM is the same JVM PyFlink runs against. Use the package's wrappers to build the spec objects, then hand them to ordinary PyFlink operators:

from pyflink.datastream import StreamExecutionEnvironment

env = StreamExecutionEnvironment.get_execution_environment()
# ... pull the underlying StreamExecutionEnvironment Java object, pass to
# CrawlerCore.builder().frontier(...).open(env_java), etc.

See agentic_flink.examples.live_research for a full PyFlink-driven job graph that wires the framework's CrawlerCore + IngestionPipeline + RetrievalPipeline together.

What's wrapped

Module	Surface
`agentic_flink.agent`	`Agent`, `AgentBuilder`, default minimal state machine
`agentic_flink.llm`	`ChatSetup`, `ChatMessage`, `langchain4j_ollama`, `langchain4j_openai`, `chat()`
`agentic_flink.tools`	`@tool` decorator, `PythonTool`
`agentic_flink.memory`	`flink_state_short_term`, `flink_state_brute_force`, `flink_state_hnsw`
`agentic_flink.embedding`	`EmbeddingSetup`, `ollama_embedding`, `djl_embedding`
`agentic_flink.corpus`	`single_operator`, `broadcast`, `external`
`agentic_flink.channel`	`static_seed`, `kafka`, `kafka_context`, `webhook`, `tool_invocation_side_output`, `tool_invocation_in_jvm`
`agentic_flink.inference`	`InferenceSetup`, `djl_classification`, `djl_embedding`, `classifier_guardrail`, `inference_tool`
`agentic_flink.web`	`options`, `fetch_tool`, `extract_links_tool`, `url_request`, `crawler_core`
`agentic_flink.ingest`	`recursive_chunker`, `chunk()`, `pipeline_from`
`agentic_flink.retrieve`	`pipeline_from`
`agentic_flink.listener`	`PyAgentEventListener`
`agentic_flink.skill`	`Skill`, `mcp_stdio`, `mcp_http`

Every wrapper exposes _to_java() to drop down to the live Java object.

Examples

Three runnable scripts under agentic_flink.examples:

quickstart — calculator tool + agent build, no LLM call.
rag — sequential Python RAG with sentence-transformer embeddings.
live_research — full PyFlink job with two-input crawler + retrieve.

python -m agentic_flink.examples.quickstart
python -m agentic_flink.examples.rag
python -m agentic_flink.examples.live_research

Testing

The package's pytest suite assumes a built framework jar. From the repo root:

mvn -DskipTests package
pip install -e python[test]
pytest python/tests/

The shared conftest.py boots the JVM once per session and discovers the framework + runtime jars via Maven.

Troubleshooting

FileNotFoundError: agentic-flink jar not found — set AGENTIC_FLINK_JAR or run mvn -DskipTests package.
NoClassDefFoundError: org/slf4j/LoggerFactory — the shaded jar doesn't include <scope>provided</scope> Flink deps. Pass the full runtime classpath via extra_jars= to start_jvm. Generate it with mvn dependency:build-classpath -Dmdep.outputFile=/tmp/cp.txt.
Initial state has no outgoing transitions — the Python builder supplies a permissive default state machine; if you call .with_state_machine(...) make sure it covers every non-terminal AgentState.
TypeError: No matching overloads found — JPype is strict about Java boxed types (Long, Integer, etc.). The wrappers handle the common cases; for new bindings explicit box with af.jclass("java.lang.Long")(int(x)).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python API

JPype standalone path

Install

Quick start

The `@tool` decorator

Listeners (and other interfaces) in Python

PyFlink integration

What's wrapped

Examples

Testing

Troubleshooting

FilesExpand file tree

python.md

Latest commit

History

python.md

File metadata and controls

Python API

JPype standalone path

Install

Quick start

The @tool decorator

Listeners (and other interfaces) in Python

PyFlink integration

What's wrapped

Examples

Testing

Troubleshooting

The `@tool` decorator