Redefining Go Functions on Mac OS

March 23, 2026

UPDATE: After publishing this, I found a much simpler way to do the same thing. I’m leaving this post mostly unchanged, with its unnecessary insanity is intact. Everything here still works, but it’s more complicated than necessary.

I’ve been building a package to redefine Go functions at runtime (monkey patching, if you like). I started with amd64, but since Macs are such a popular platform for developers, I wanted arm64 support. Lacking Apple hardware, I did the next best thing and ported it to Linux on arm64. That should be enough, right? When I ported the amd64 version to Intel-based Macs, I had to change a single CGO wrapper. Apple silicon should be about the same, right? If only.

The package works by calling mprotect to get write access to the program’s text segment, then it replaces the beginning of a function’s compiled code with a JMP or B instruction to the replacement function (see my last post for the details). It’s unfit for serious programs, but we can use it for unserious ones:

package main

import (
        "fmt"
        "log"
        "strings"

        "github.com/pboyd/redefine"
)

func main() {
        redefine.Func(fmt.Appendf, func(b []byte, format string, v ...any) []byte {
                orig := redefine.Original(fmt.Appendf)

                b = orig(b, format, v...)

                if strings.Contains(strings.ToLower(format), "spanish inquisition") {
                        b = orig(b, "\n\nNOBODY EXPECTS THE SPANISH INQUISITION\n")
                }

                return b
        })

        log.Printf("I didn't expect a kind of Spanish Inquisition")
}

On Darwin/arm64, the mprotect calls always failed with EACCES. I thought the solution must be right there, only I couldn’t find it. All I had to test with was GitHub actions and each attempt took a couple minutes. So I admitted defeat and added a note saying that Darwin/arm64 doesn’t work and figured that would be that. Leaving it unfinished bothered me, but what was I going to do? Buy a used Mac Mini, a book on ARM assembly, and then spend all my spare time for a few weeks porting dumb joke programs to a platform I don’t even use? Well, yes, apparently that’s what I was going to do.

$ uname -mv
Darwin Kernel Version 25.3.0: Wed Jan 28 20:53:31 PST 2026; root:xnu-12377.91.3~2/RELEASE_ARM64_T8103 arm64
$ go run .
2026/03/24 19:55:23 I didn't expect a kind of Spanish Inquisition

NOBODY EXPECTS THE SPANISH INQUISITION

This post walks through a proof-of-concept implementation for redefining Go functions on Mac OS / Darwin on Arm64. You can get full source code on GitHub. It’s lengthy, so I’m only pasting the highlights here.

How Apple broke `mprotect`

On other platforms, we only need to call mprotect for read-write-execute permissions on the program’s text segment (i.e. the memory segment with the executable code). The problem is that for Darwin on arm64, Apple locked it down tight. mprotect, mmap, and their Darwin cousins (mach_vm_protect, mach_vm_allocate, and mach_vm_remap) block every attempt to get read-write access to the text segment. I tried a lot of things, but they all failed, so I eventually abandoned modifying the text segment itself (UPDATE: there was a way, I just didn’t know about it).

Apple did leave one door open for self-modifying code: the MAP_JIT flag to mmap. When combined with the non-standard function pthread_jit_write_protect_np, a thread can swap between read-execute and read-write permissions to MAP_JIT memory. It’s not much to work with, because our text segment isn’t allocated with MAP_JIT and we can’t remap it. To use it, we have to allocate a new text segment.

The plan, then, is to copy the program’s text segment to a new mapping with MAP_JIT, and execute from that copy.

Duplicating the code

Before we can copy the text segment, we need to find it. C provides extern variables for text and etext, but Go—for reasons I cannot fathom—doesn’t give this information easily. But Go’s runtime has this information internally, and we can get a copy of it through linkname:

//go:linkname lastmoduledatap runtime.lastmoduledatap
var lastmoduledatap *moduledata

type moduledata struct {
        // [snip]

        text, etext           uintptr
        noptrdata, enoptrdata uintptr
        data, edata           uintptr
        bss, ebss             uintptr
        noptrbss, enoptrbss   uintptr
        covctrs, ecovctrs     uintptr
        end, gcdata, gcbss    uintptr
        types, etypes         uintptr
        rodata                uintptr
        gofunc                uintptr // go.func.*
        epclntab              uintptr

        // [snip]
}

source

linkname binds that variable to Go’s internal moduledata. There’s no published definition of that struct, so we have to copy the definition from Go’s source code. The version above is from Go 1.26, which differed slightly from 1.25. This is brittle, and Go could break it tomorrow, but it’s good enough for today.

The text segment runs from the address in text to etext (“end text”), but we need to copy from text to rodata because, as I discovered the hard way, the linker places cgo stubs between etext and rodata.

Now we can allocate a new text segment and copy all the machine code:

func duplicateText() (uintptr, error) {
	text := lastmoduledatap.text & pageMask
	etext := (lastmoduledatap.rodata + pageSize - 1) & pageMask

	destPtr, err := unix.MmapPtr(
		-1, 0,
		unsafe.Pointer(lastmoduledatap.end),
		etext-text,
		unix.PROT_READ|unix.PROT_WRITE|unix.PROT_EXEC,
		unix.MAP_ANON|unix.MAP_PRIVATE|unix.MAP_JIT,
	)
	if err != nil {
		return 0, fmt.Errorf("mmap JIT text (%d bytes): %w", etext-text, err)
	}

	cgo.JITWriteStart()
	defer cgo.JITWriteEnd()

	src := unsafe.Slice((*byte)(unsafe.Pointer(text)), etext-text)
	dest := unsafe.Slice((*byte)(destPtr), etext-text)
	copy(dest, src)

	cgo.ClearCache(dest)

	return uintptr(destPtr) - text, nil
}

The mmap call gets read-write-execute permissions because Apple has its own protection mechanism with pthread_jit_write_protect_np; layering the standard Unix memory protections on top is unnecessary. The JITWriteStart and JITWriteEnd calls are thin cgo wrappers around pthread_jit_write_protect_np.

This function returns the offset to add to an address in the old text segment to get the equivalent address in the new text segment.

With a little pointer tomfoolery, we can use the offset to call simple functions that we’ve copied:

func main() {
    offset, _ := duplicateText()

    dupTestFunc := offsetFunc(testFunc, offset)
    fmt.Println(dupTestFunc(2))
    // Prints 4
}

func testFunc(x int) int {
        return x * 2
}

var refs []any

// offsetFunc takes the address of fn and adds offset to it, then derefs that
// address as a function of the same type.
func offsetFunc[T any](fn T, offset uintptr) T {
	fnv := reflect.ValueOf(fn)
	if fnv.Kind() != reflect.Func {
		panic("not a function")
	}

	ptr := new(uintptr)
	*ptr = fnv.Pointer() + offset
	refs = append(refs, ptr)

	return *(*T)(unsafe.Pointer(&ptr))
}

Unfortunately, it only works for trivial functions. This variation probably crashes:

var multiplier int = 2

func testFunc(x int) int {
        return x * multiplier
}

The problem is that multiplier is stored in static data. testFunc disassembles to:

ADRP 1003520(PC), R27                // adrp x27, .+0xf5000
MOVD 1584(R27), R1                   // ldr x1, [x27,#1584]
MUL R1, R0, R0                       // mul x0, x0, x1
RET                                  // ret

That ADRP instruction loads the address of the memory page containing multiplier. But ADRP is relative to the address of the instruction (stored in the program counter, or PC, register). Now that we’ve moved the code, pc+0xf5000 is probably pointing at unallocated space, so the program crashes. Or it’s pointing at allocated memory, which is unlikely to hold the value 2, so you get the wrong answer.

To solve that problem, we need to walk through the copied text segment and update the arguments to the ADRP instructions to point to the same data relative to the new address. golang.org/x/arch/arm64/arm64asm makes parsing (although not encoding) the instructions easy.

const adrAddressMask = uint32(3<<29 | 0x7ffff<<5)

func fixADRP(code []byte, offset uintptr) {
	destBase := uintptr(unsafe.Pointer(unsafe.SliceData(code)))
	srcBase := destBase - offset

	for i := uintptr(0); i < uintptr(len(code)); i += 4 {
		raw := code[i : i+4]
		inst, _ := arm64asm.Decode(raw)

		destPC := destBase + i
		srcPC := srcBase + i

		switch inst.Op {
		case arm64asm.ADRP:
			oldArg := int64(inst.Args[1].(arm64asm.PCRel))
			newArg := uint32((int64(srcPC&^uintptr(0xfff)) + oldArg - int64(destPC&^uintptr(0xfff))) >> 12)

			encoded := binary.LittleEndian.Uint32(raw) &^ adrAddressMask
			encoded |= (newArg & 3) << 29             // Lowest 2 bits to bits 30 and 29
			encoded |= ((newArg >> 2) & 0x7ffff) << 5 // Highest 19 bits to bits 23 to 5
			binary.LittleEndian.PutUint32(raw, encoded)

		}
	}
}

source

BL (CALL in Go assembly) also takes a PC-relative address and would normally need the same adjustment. But since we copied the entire text segment, those addresses point to their equivalent function in the copy.

With that in place, our duplicated functions can use static data.

The last major problem with our duplicated text segment only happens when there’s a panic. If our test function were instead:

var divisor int = 0

func testFunc(x int) int {
        return x / divisor
}

As you probably expect, it crashes, but instead of a familiar divide by 0 it’s unknown pc. The processor gladly executes instructions from our new text segment, but the Go runtime doesn’t know what’s at those addresses. To fix it, we need to register a new “module”:

var newModdata moduledata

func duplicateText() (uintptr, error) {
	// ... same as before ...

	fixADRP(dest, offset)

	cgo.ClearCache(dest)

	newModdata = *lastmoduledatap
	newModdata.text += offset
	newModdata.etext += offset
	newModdata.minpc += offset
	newModdata.maxpc += offset

	newPcHeader := *lastmoduledatap.pcHeader
	newPcHeader.textStart += offset
	newModdata.pcHeader = &newPcHeader

	newModdata.textsectmap = make([]textsect, len(lastmoduledatap.textsectmap))
	for i := range lastmoduledatap.textsectmap {
		newModdata.textsectmap[i] = lastmoduledatap.textsectmap[i]
		newModdata.textsectmap[i].baseaddr += offset
	}

	lastmoduledatap.next = &newModdata

	return uintptr(destPtr) - text, nil
}

Our module shares the same data segments as the original one, so we copy that one and update the text addresses. The module data is stored in a singly linked list, so we insert our copy as next on lastmoduledatap. It’s important that our moduledata is statically allocated and not on the heap to prevent GC collection. Once that’s in place, we get a much more normal stack trace:

panic: runtime error: integer divide by zero

goroutine 1 [running]:
main.testFunc(0x3ab69e4ce668?)
        /Users/pboyd/dev/redefine-macos-poc/redefine.go:115 +0x38
main.fork()
        /Users/pboyd/dev/redefine-macos-poc/redefine.go:89 +0x84
main.redefineFunc[...](0x10482f130, 0x10482f120)
        /Users/pboyd/dev/redefine-macos-poc/redefine.go:21 +0x28
main.main()
        /Users/pboyd/dev/redefine-macos-poc/main.go:14 +0x34
exit status 2

It has the original source code lines but with addresses from the new text segment.

The full duplicateText source

Switching to the duplicate

Now we have a functioning copy of the program text, but what good is that? The program is still running from the original read-only text segment, not our read-write duplicate. To solve this, we need to know two things about Arm assembly.

First, subroutine calls. On Arm, subroutines are called with the BL instruction. It does an unconditional branch (like B), and stores the return address in the link register (lr). The RET instruction jumps to the address in lr, so if we can update those lr addresses, we can switch our program to run anything.

Second, the stack. Each (non-trivial) function gets a stack frame, which exists from the address in the frame pointer (fp) to the stack pointer (sp). Arm uses register x29 as frame pointer. Go’s function preamble stores the original fp value on the stack before setting fp to its new value. So the stack frame, at its most basic, looks like this:

-------------------
|     prev fp     | <- fp
-------------------
|      ...        |
-------------------
|   last item     | <- sp
-------------------

Because the link register only holds one value, it must be saved before making function calls. The normal convention, which Go follows, is to push lr onto the stack immediately after the frame pointer:

-------------------
|     prev fp     | <- fp
-------------------
|       lr        |
-------------------
|      ...        |
-------------------
|   last item     | <- sp
-------------------

If we get the frame pointer, we can walk back up the stack and shift the return addresses to our copy. We need a small assembly function to get the fp value:

TEXT ·getFrame(SB),NOSPLIT,$0-8
    MOVD R29, ret+0(FP)
    RET

And then a little Go code to interpret the addresses:

type frame struct {
        next *frame
        lr   uintptr
}

func getFrame() *frame

Then we can adjust return addresses all the way back up the call stack:

for f := getFrame(); f != nil; f = f.next {
        if f.lr >= origText && f.lr < origEtext {
                f.lr += offset
        }
}

Unfortunately, the call stack only belongs to a single goroutine. But once we’re running from the duplicate text, any new goroutines will also be in the duplicate. To be most effective, switch the main goroutine early, before starting any other goroutines.

The major failing of this approach is function pointers.

func main() {
        redefineFunc(time.Now, myTimeNow)

        fmt.Println(time.Now().Format(time.Kitchen))
        // Prints 5:00PM

        f := func() {
            fmt.Println(time.Now().Format(time.Kitchen))
        }
        f()
        // Prints the real time

        f() // Call it again to avoid inlining
}

f is a pointer to an anonymous function stored in rodata memory, so we need to search that memory for function pointers and update them. See patchRodataCodePtrs for details.

Patching functions

With the preliminaries out of the way, we can finally patch a function. Unfortunately, pthread_jit_write_protect_np(0) switches the thread from having read-execute permissions on MAP_JIT memory to read-write permissions. In other words, the thread that writes can’t itself be executing from MAP_JIT memory. The simplest solution I’ve found is to switch back to the original text section when updating the code.

var writeB func([]byte, int32) = _writeB

func _writeB(buf []byte, relAddr int32) {
	cgo.JITWriteStart()

	// Encode the instruction:
	// -----------------------------------
	// | 000101 | ... 26 bit address ... |
	// -----------------------------------
	inst := (5 << 26) | (uint32(relAddr>>2) & (1<<26 - 1))
	binary.LittleEndian.PutUint32(buf, inst)

	cgo.ClearCache(buf)

	cgo.JITWriteEnd()
}

Then, after we patch function pointers, we switch the address writeB so it points back to the original text segment (offsetFunc was defined in the first section of this post).

writeB = offsetFunc(writeB, -offset)

With that, we’ve reached the end. That’s the only way I found to monkey patch Go functions on a recent Mac. It works in all scenarios that I’ve thought to test, but given the number and severity of the bugs encountered, I’ve surely missed something. Don’t use these techniques for anything serious (unless you work for bytedance, perhaps). I suppose I should close by thanking Apple for making this all possible: without you, my original program would just work.

Source / History

Redefining Go Functions on Mac OS

How Apple broke mprotect

Duplicating the code

Switching to the duplicate

Patching functions

How Apple broke `mprotect`