Pipelines Gotchas

Non-obvious failure modes (not well covered by docs). For current limits and error semantics, pull https://developers.cloudflare.com/pipelines/platform/limits/.

Events accepted but never appear (most common)

HTTP 200 / send() resolves, but no data in the sink. Causes:

Schema validation failure — structured streams accept then silently drop invalid events during processing. Validate client-side (Zod) and monitor pipelinesUserErrorsAdaptiveGroups.
First-flush warm-up — first data takes 3–7 minutes (warm-up + namespace/table creation) even with --roll-interval 10. Poll ≥5 min in tests.
Roll interval not elapsed — default 300s.
Silent sink failure — deleted bucket or expired token. Check recordsWritten > 0 but filesWritten = 0; inspect failure_reason via GET /pipelines/{id}.

Everything is immutable

Cannot modify stream schema, pipeline SQL, or sink config — delete and recreate. Use version naming (events_v1) and keep SQL in version control.

curl -X DELETE "$BASE_URL/pipelines/{id}" -H "Authorization: Bearer $API_TOKEN"
curl -X DELETE "$BASE_URL/sinks/{id}"     -H "Authorization: Bearer $API_TOKEN"
curl -X DELETE "$BASE_URL/streams/{id}"   -H "Authorization: Bearer $API_TOKEN"

Worker binding undefined (`env.MY_STREAM`)

Use the stream ID, not pipeline ID, in wrangler.jsonc.
Binding field is "stream" (June 2026); old "pipeline" still works.
Redeploy after adding the binding.

REST API field names ≠ CLI flags

r2_data_catalog vs --type r2-data-catalog, table_name vs --table, token vs --catalog-token, and format is required in REST but implied in CLI. See configuration.md.

`wrangler pipelines delete` defaults to "no"

Non-interactive environments answer "no" automatically — use REST DELETE for CI/automation.

Behavioral Notes

__ingest_ts auto-added (TIMESTAMP, day-partitioned). Don't put it in your schema.
Sinks can't target existing tables — the sink creates its own. Use PySpark to write to existing tables.
JSON-only input — no Avro/Protobuf/CSV.
Naming: streams/sinks/pipelines use underscores; buckets use hyphens.
Metrics lag 5–10 min after creation.
Pipeline SQL is row-level only — no GROUP BY/aggregation/window functions (do aggregation in R2 SQL at query time). CTEs and UNNEST are supported.

Debug Checklist

[ ] Stream exists: wrangler pipelines streams list
[ ] Pipeline running (not initializing/failed): GET /pipelines/{id}, check failure_reason
[ ] SQL matches schema; sink token valid; bucket + catalog exist
[ ] Worker redeployed; binding uses stream ID under "stream"
[ ] Waited ≥5 min (first flush)
[ ] Sink metrics: filesWritten > 0; error metrics show no drops

Pipelines Gotchas

Non-obvious failure modes (not well covered by docs). For current limits and error semantics, pull https://developers.cloudflare.com/pipelines/platform/limits/.

Events accepted but never appear (most common)

HTTP 200 / send() resolves, but no data in the sink. Causes:

Schema validation failure — structured streams accept then silently drop invalid events during processing. Validate client-side (Zod) and monitor pipelinesUserErrorsAdaptiveGroups.
First-flush warm-up — first data takes 3–7 minutes (warm-up + namespace/table creation) even with --roll-interval 10. Poll ≥5 min in tests.
Roll interval not elapsed — default 300s.
Silent sink failure — deleted bucket or expired token. Check recordsWritten > 0 but filesWritten = 0; inspect failure_reason via GET /pipelines/{id}.

Everything is immutable

Cannot modify stream schema, pipeline SQL, or sink config — delete and recreate. Use version naming (events_v1) and keep SQL in version control.

curl -X DELETE "$BASE_URL/pipelines/{id}" -H "Authorization: Bearer $API_TOKEN"
curl -X DELETE "$BASE_URL/sinks/{id}"     -H "Authorization: Bearer $API_TOKEN"
curl -X DELETE "$BASE_URL/streams/{id}"   -H "Authorization: Bearer $API_TOKEN"

Worker binding undefined (`env.MY_STREAM`)

Use the stream ID, not pipeline ID, in wrangler.jsonc.
Binding field is "stream" (June 2026); old "pipeline" still works.
Redeploy after adding the binding.

REST API field names ≠ CLI flags

r2_data_catalog vs --type r2-data-catalog, table_name vs --table, token vs --catalog-token, and format is required in REST but implied in CLI. See configuration.md.

`wrangler pipelines delete` defaults to "no"

Non-interactive environments answer "no" automatically — use REST DELETE for CI/automation.

Behavioral Notes

__ingest_ts auto-added (TIMESTAMP, day-partitioned). Don't put it in your schema.
Sinks can't target existing tables — the sink creates its own. Use PySpark to write to existing tables.
JSON-only input — no Avro/Protobuf/CSV.
Naming: streams/sinks/pipelines use underscores; buckets use hyphens.
Metrics lag 5–10 min after creation.
Pipeline SQL is row-level only — no GROUP BY/aggregation/window functions (do aggregation in R2 SQL at query time). CTEs and UNNEST are supported.

Debug Checklist

[ ] Stream exists: wrangler pipelines streams list
[ ] Pipeline running (not initializing/failed): GET /pipelines/{id}, check failure_reason
[ ] SQL matches schema; sink token valid; bucket + catalog exist
[ ] Worker redeployed; binding uses stream ID under "stream"
[ ] Waited ≥5 min (first flush)
[ ] Sink metrics: filesWritten > 0; error metrics show no drops

Cloudflare Platform Skill

references/pipelines/gotchas.md

Pipelines Gotchas

Events accepted but never appear (most common)

Everything is immutable

Worker binding undefined (`env.MY_STREAM`)

REST API field names ≠ CLI flags

`wrangler pipelines delete` defaults to "no"

Behavioral Notes

Debug Checklist

See Also

Preparing the source view

Cloudflare Platform Skill

references/pipelines/gotchas.md

Pipelines Gotchas

Events accepted but never appear (most common)

Everything is immutable

Worker binding undefined (`env.MY_STREAM`)

REST API field names ≠ CLI flags

`wrangler pipelines delete` defaults to "no"

Behavioral Notes

Debug Checklist

See Also

Cloudflare Platform Skill

references/pipelines/gotchas.md

Pipelines Gotchas

Events accepted but never appear (most common)

Everything is immutable

Worker binding undefined (env.MY_STREAM)

REST API field names ≠ CLI flags

wrangler pipelines delete defaults to "no"

Behavioral Notes

Debug Checklist

See Also

Preparing the source view

Cloudflare Platform Skill

references/pipelines/gotchas.md

Pipelines Gotchas

Events accepted but never appear (most common)

Everything is immutable

Worker binding undefined (env.MY_STREAM)

REST API field names ≠ CLI flags

wrangler pipelines delete defaults to "no"

Behavioral Notes

Debug Checklist

See Also

Worker binding undefined (`env.MY_STREAM`)

`wrangler pipelines delete` defaults to "no"

Worker binding undefined (`env.MY_STREAM`)

`wrangler pipelines delete` defaults to "no"