fix: MIME Decoding corrupts non-ASCII characters in Base64-encoded words by williballenthin · Pull Request #2291 · gchq/CyberChef

williballenthin · 2026-03-24T08:45:02Z

fromBase64() defaults to returning a UTF-8 decoded string, which is then passed to codepage.utils.decode() that treats each char code as a raw byte.
For multi-byte UTF-8 characters, this double-decoding produces garbage (e.g. "café" becomes "caf退").

Pass returnType="byteArray" so codepage receives raw bytes and performs the single correct UTF-8 decode.

Closes #2280

AI disclosure
Claude Code Opus 4.6

fromBase64() defaults to returning a UTF-8 decoded string, which is then passed to codepage.utils.decode() that treats each char code as a raw byte. For multi-byte UTF-8 characters, this double-decoding produces garbage (e.g. "café" becomes "caf退"). Pass returnType="byteArray" so codepage receives raw bytes and performs the single correct UTF-8 decode. Closes gchq#2280

GCHQDeveloper581 added the AI Used label Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: MIME Decoding corrupts non-ASCII characters in Base64-encoded words#2291

fix: MIME Decoding corrupts non-ASCII characters in Base64-encoded words#2291
williballenthin wants to merge 1 commit intogchq:masterfrom
williballenthin:fix-2280

williballenthin commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

williballenthin commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants