ci: E2E integration test against latest OpenClaw by gidim · Pull Request #64 · comet-ml/opik-openclaw

gidim · 2026-04-10T18:43:21Z

Problem

Unit tests can't catch the class of bug fixed in #59 — hooks silently dropped due to OpenClaw plugin lifecycle changes. OpenClaw moves fast; we need a test that will break if a future OpenClaw release breaks the plugin again.

What this adds

A self-contained E2E workflow (.github/workflows/e2e.yml) that:

Installs the latest published OpenClaw (npm install -g openclaw@latest)
Builds the plugin from source and installs the tarball into OpenClaw
Starts a mock Opik server (scripts/mock-opik-server.mjs) that captures all trace/span API calls
Starts a mock LLM server (scripts/mock-llm-server.mjs) — OpenAI-compatible, returns a canned response so no real API key is needed
Runs a real gateway turn (openclaw agent --message "ping")
Asserts that the mock Opik server received ≥1 trace batch and ≥1 span batch

If the PR #59 regression had existed, step 6 would have caught it — zero traces would have reached the mock server.

No secrets required

Both LLM and Opik are mocked locally. The workflow is fully self-contained.

Matrix

The workflow runs against openclaw@latest today. The matrix can be extended to pin specific versions as OpenClaw ships new releases.

Files

File	Purpose
`.github/workflows/e2e.yml`	CI workflow
`scripts/mock-opik-server.mjs`	Captures Opik API calls, writes `e2e-result.json`
`scripts/mock-llm-server.mjs`	OpenAI-compatible mock, supports streaming + non-streaming
`scripts/check-e2e-result.mjs`	Reads result file, exits non-zero if traces/spans missing

Installs OpenClaw fresh, builds the plugin from source, starts the gateway with a mock LLM + mock Opik server, runs a real agent turn, and asserts traces and spans were exported. Catches hook-lifecycle regressions that unit tests cannot detect. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

openclaw latest now requires Node >=22.14.0; pinning to 22.12.0 broke the plugin install step. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

npm@latest self-upgrade breaks on Node 22.22.2 in GitHub Actions. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- gateway.auth must be an object: {"mode": "none"} - models.providers.openai.apiUrl -> baseUrl - models.providers.openai.models is required (array) - remove unrecognized keys: models.defaults, agents.defaults.provider Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Model config uses agents.defaults.model.primary with provider/model format. Credentials go under auth.profiles, not models.providers. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

auth.profiles only accepts provider and mode — apiUrl and apiKey are loaded from OPENAI_API_KEY and OPENAI_BASE_URL env vars. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

openclaw agent requires --to, --session-id, or --agent to target a session. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Use token auth so CLI has operator scope - Write per-agent auth-profiles.json for OpenAI API key - Pass OPENCLAW_GATEWAY_TOKEN to agent turn command Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

openclaw@latest install consistently takes 10+ minutes. Caching the npm global prefix directory cuts that to seconds on cache hits. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

All steps that connect to the gateway need OPENCLAW_GATEWAY_TOKEN. Setting it at job level avoids missing it on health checks and stop steps. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

gidim requested a review from a team as a code owner April 10, 2026 18:43

Gideon Mendels and others added 16 commits April 10, 2026 14:44

ci: run E2E daily to catch new OpenClaw releases breaking the plugin

9086c2c

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: use Node 22.x in E2E workflow to match openclaw@latest requirement

77f8e00

openclaw latest now requires Node >=22.14.0; pinning to 22.12.0 broke the plugin install step. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: pin npm to 11.6.2 in E2E workflow

fdcc50a

npm@latest self-upgrade breaks on Node 22.22.2 in GitHub Actions. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: models array expects objects not strings

f528437

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: model object requires name field not id

d299652

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: rewrite OpenClaw config based on actual working config schema

d80ab82

Model config uses agents.defaults.model.primary with provider/model format. Credentials go under auth.profiles, not models.providers. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: pass OpenAI credentials via env vars, not auth profile config

f047a2d

auth.profiles only accepts provider and mode — apiUrl and apiKey are loaded from OPENAI_API_KEY and OPENAI_BASE_URL env vars. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: specify --agent main for agent turn command

9b836fc

openclaw agent requires --to, --session-id, or --agent to target a session. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: fix gateway auth and agent API key for E2E test

1abec55

- Use token auth so CLI has operator scope - Write per-agent auth-profiles.json for OpenAI API key - Pass OPENCLAW_GATEWAY_TOKEN to agent turn command Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: cache OpenClaw global install to speed up E2E runs

1053d45

openclaw@latest install consistently takes 10+ minutes. Caching the npm global prefix directory cuts that to seconds on cache hits. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: set gateway token and OpenAI creds as job-level env vars

8847ce9

All steps that connect to the gateway need OPENCLAW_GATEWAY_TOKEN. Setting it at job level avoids missing it on health checks and stop steps. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ci: harden OpenClaw E2E assertions

b5d869c

ci: use an explicit mock responses provider

d5bd1d1

ci: accept batched Opik finalization

7fc9288

ci: handle snake_case Opik end timestamps

9b99a27

vincentkoc merged commit 29e4ad4 into main Apr 21, 2026
1 check passed

vincentkoc deleted the ci/e2e-integration-test branch April 21, 2026 18:30

vincentkoc mentioned this pull request Apr 21, 2026

fix(agents): keep mocked OpenAI Responses on HTTP openclaw/openclaw#69815

Merged

25 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: E2E integration test against latest OpenClaw#64

ci: E2E integration test against latest OpenClaw#64
vincentkoc merged 17 commits intomainfrom
ci/e2e-integration-test

gidim commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gidim commented Apr 10, 2026

Problem

What this adds

No secrets required

Matrix

Files

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants