The missing link: explaining ELF static linking, semantically
Published version
Peer-reviewed
Repository URI
Repository DOI
Type
Change log
Authors
Abstract
jats:p Beneath the surface, software usually depends on complex jats:italiclinker behaviour</jats:italic> to work as intended. Even linking <pre>hello_world.c</pre> is surprisingly involved, and systems software such as <pre>libc</pre> and operating system kernels rely on a host of linker features. But linking is poorly understood by working programmers and has largely been neglected by language researchers. </jats:p> jats:p In this paper we survey the many use-cases that linkers support and the poorly specified jats:italiclinker speak</jats:italic> by which they are controlled: metadata in object files, command-line options, and linker-script language. We provide the first validated formalisation of a realistic executable and linkable format (ELF), and capture aspects of the Application Binary Interfaces for four mainstream platforms (AArch64, AMD64, Power64, and IA32). Using these, we develop an executable specification of static linking, covering (among other things) enough to link small C programs (we use the example of bzip2) into a correctly running executable. We provide our specification in Lem and Isabelle/HOL forms. This is the first formal specification of mainstream linking. We have used the Isabelle/HOL version to prove a sample correctness property for one case of AMD64 ABI relocation, demonstrating that the specification supports formal proof, and as a first step towards the much more ambitious goal of verified linking. Our work should enable several novel strands of research, including linker-aware verified compilation and program analysis, and better languages for controlling linking. </jats:p>
Description
Keywords
Journal Title
Conference Name
Journal ISSN
1558-1160