Recommended use of C-language types
As confusion frequently arises about the most appropriate types to use for integers, pointers, and pointer-related values, we make the following recommendations:
-
int_t
,int32_t
,long_t
,int64_t
, ...: These pure integer types should be used to hold integer values that will never be cast to a pointer type without first combining them with another pointer value — e.g., by using them as an array offset. Most integers in a C/C++-language program will be of these types. -
ptraddr_t
: This is a new integer type introduced by CHERI C and should be used to hold addresses.ptraddr_t
should not be directly cast to a pointer type for dereference; instead, it must be combined with an existing valid capability to the address space to generate a dereferenceable pointer. Typically, this is done using thecheri_address_set(c, x)
function. -
size_t
,ssize_t
: These integer types should be used to hold the unsigned or signed lengths of regions of address space.
-
ptrdiff_t
: This integer type describes the difference of indices between two pointers to elements of the same array, and should not be used for any other purpose. It can be added to a pointer to obtain a new pointer, but the result will be dereferenceable only if the address lies within the bounds of the pointer from which it was derived.Less standards-compliant code sometimes uses
ptrdiff_t
when the programmer more likely meantintptr_t
or (less commonly)size_t
. When porting code, it is worthwhile to audit use ofptrdiff_t
. -
intptr_t
,uintptr_t
: These integer types should be used to hold values that may be valid pointers if cast back to a pointer type. When anintptr_t
is assigned an integer value — e.g., due to constant initialization to an integer in the source — and the result is cast to a pointer type, the pointer will be invalid and hence non-dereferenceable. These types will be used in two cases: (1) Where there is uncertainty as to whether the value to be held will be an integer or a pointer — e.g., for an opaque argument to a callback function; or (2) Where it is more convenient to place a pointer value in an integer type for the purposes of arithmetic (which takes place on the capability's address and in units of bytes, as if the pointer had been cast tochar *
).The observable, integer range of a
uintptr_t
is the same as that of aptraddr_t
(orptrdiff_t
forintptr_t
), despite the increased alignment and storage requirements. -
intmax_t
,uintmax_t
: According to the C standard, these integer types should be capable of representing any value of any (unsigned) integer type. In CHERI C/C++, they are not provenance-carrying and can represent the integer range ofuintptr_t
/intptr_t
, but not the capability metadata or tag bit. As the observable value ofintptr_t
/intptr_t
is the pointer address range, we believe this choice to be compatible with the C standard.Additionally, due to ABI constraints, it would be extremely difficult to change the width of these types from 64 to 129 bits. This is also true for other architectures such as x86: despite Clang and GCC supporting an
__int128
type,intmax_t
remains 64 bits wide.We generally do not recommend use of these types in CHERI C/C++. However, the types may be useful in
printf
calls (using the%j
format string width modifier) as theinttypes.h
PRI*
macros can be rather verbose. -
maxalign_t
: This type is defined in C as an object type whose alignment is the greatest fundamental alignment and this includes capability types for CHERI C/C++. We found that some custom allocators usesizeof(long double)
orsizeof(uint64_t)
to align their return values. While this appears to work on most architectures, in CHERI C/C++ this must be changed toalignof(maxalign_t)
.1 -
char *
, ...: These pointer types are suitable for dereference, but in general should not be cast to or from arbitrary integer values. Valid pointers are always derived from other valid pointers (including those cast tointptr_t
oruintptr_t
), and cannot be constructed using arbitrary integer arithmetic.
It is important to note that uintptr_t
is no longer the same size as
size_t
. This difference may require making some changes to
existing code to use the correct type depending on whether the variable
needs to be able store a pointer type. In cases where this is not obvious
(such as for a callback argument), we recommend the use of uintptr_t
.
This ensures that provenance is maintained.
It is important to use alignof
instead of sizeof
since many
common implementations, such as GCC and FreeBSD, define maxalign_t
as a
struct
and not a union
.