Confused about implicit type promotion in C
The following code compiles and prints the message.
#include <stdio.h>
char test_char = 'a';
int test_int = 97;
int main(void)
{
if (test_char == test_int)
{
printf("%s\n", "compared int and char with no issues");
}
}
Even if we compile it with all possible warnings enabled:
/usr/bin/gcc -Wall -Wextra -Wconversion -Werror -Wfloat-equal -Wmissing-noreturn -Wmissing-prototypes -Wsequence-point -Wshadow -Wstrict-prototypes -Wunreachable-code -pedantic -std=c18 -ggdb3
However, no warning or errors are printed, even though we compare int
to char
.
Googling shows that this is called “Implicit type promotion” and looks like compiler can not be configured to catch this behavior, since it is actually a language feature.
Some static analysis tools can potentially detect this. Clang
static analyzer has a check called
Loss of sign/precision in implicit conversions,
which is the closest thing to behavior I would expect.
One can run it by prepending the compilation command with scan-build
. But still, it does not catch all instances of conversion, only
those were loss of precision is happening.
$ scan-build -enable-checker alpha.core.Conversion /usr/bin/gcc -Wall -Wextra -Wconversion -Werror -Wfloat-equal -Wmissing-noreturn -Wmissing-prototypes -Wsequence-point -Wshadow -Wstrict-prototypes -Wunreachable-code -pedantic -std=c18 -ggdb3 test.c -o test.exe
scan-build: Using '/usr/bin/clang-11' for static analysis
scan-build: Analysis run complete.
scan-build: Removing directory '/tmp/scan-build-2020-12-28-174113-109891-1' because it contains no reports.
scan-build: No bugs found.
Coming from OCaml
, it is shocking that this code works, and we need to make an effort to make it stop compiling.
I just had a bug in my code because I wrote:
if (i == '\n')
instead of
if (line[i] == '\n')
And I would like to find out a way to not waste any time on bugs like this :)
SEI CERT C Coding Standard also talks about this, and has list of paid static analysis tools that can catch the behavior to some extent.
Also, when you have wrong specifier for printf
, it also will convert the value for you:
printf("uint range:\t[0;%d]\n", UINT_MAX); // 2^32 = 4 bytes
printf("ulong range:\t[0;%ld]\n", ULONG_MAX); // 2^64 = 8 bytes
Prints:
uint range: [0;-1]
ulong range: [0;-1]
To get the correct values, the specifiers should be u
and lu
instead.
UPD: there are useful gcc options to alert on this behavior:
-Wformat=2
-Wformat-signedness
And also, once you install libusan (dnf install libubsan.x86_64
), you can enable integer overflow
and some other undefined behavior checks with:
-fsanitize=undefined