4 billion if statements (2023)

(andreasjhkarlsson.github.io)

299 points | by damethos 6 days ago

33 comments

xnorswap 12 minutes ago
This is time efficient* but rather wasteful of space.
The best way to save space is to use a Bloom Filter.
If we capture all the even numbers, that would sadly only give us "Definitely not Even" or "Maybe Even".
But for just the cost of doubling our space, we can use two Bloom filters!
So we can construct one bloom filter capturing even numbers, and another bloom filter capturing odd numbers.
Now we have "Definitely not Even" and "Maybe Even" but also "Definitely not Odd" and "Maybe Odd".
In this manner, we can use the "evens" filter to find the odd numbers and the "odds" filter to find the even numbers.
Having done this, we'll be left with just a handful of unlucky numbers that are recorded as both "Maybe even" and "Maybe odd". These will surely be few enough in number that we can special case these in our if/else block.
The filters as a first-pass will save gigabytes of memory!
travisgriggs 33 minutes ago
I see this is 2023… the article refs GPT even then. Can’t believe it’s already that much time gone by, still seems like “last years big news”
I was gonna comment “this is what I really like to see on HN”. Then I saw the date and was sad that we’re having to dip into the history box to find fun/interesting articles more often of late it seems.
Anyone else interested in a temporary moratorium on all things LLM? We could have GPT-free-Wednesday, or something like that :)
[-]
- virgilp 1 minute ago
  Definitely not a visionary. This is how you do it in 2025: https://imgur.com/rWiP90P
- LPisGood 5 minutes ago
  > Then I saw the date and was sad that we’re having to dip into the history box to find fun/interesting articles more often of late it seems
  We don’t _have_ to. You could start a blog and display the indomitable human spirit.
ZeroConcerns 4 hours ago
Ah, yes, exactly the pointless diversion I needed for my lunch break. For science: generating a C# switch statement for similar purposes took 7 minutes on similar-ish hardware, but the resulting 99.2GB file could not be opened or compiled ('Stream is too long'), which was slightly disappointing.
Optimization efforts included increasing the internal buffer size of the StreamWriter used to create the source code: this reduced the runtime to around 6 minutes, as well as running a non-debug build, as it was observed that the poor Visual Studio metrics gathering process was contributing significantly to disk activity as well, but that ultimately didn't matter much. So, ehhm, yes, good job on that I guess?
[-]
- afandian 3 hours ago
  Isn't the obvious thing to generate different classes for different ranges over the input? Then the classes could be loaded lazily.
  And if you then make the ranges tree-shaped you get logarithmic complexity, which massively cuts down the O(n) of the rather naive chained `if` statements.
- opticfluorine 2 hours ago
  I wonder if you could generate it via a Roslyn incremental source generator instead of as a file to bypass this limit. I'm guessing not, but it does sound like fun.
  [-]
  - tkapin 12 minutes ago
    You can totally use source generators for that.
- szundi 44 minutes ago
  [dead]
mft_ 2 hours ago
Similar humour if opposite directions to an old favourite: https://joelgrus.com/2016/05/23/fizz-buzz-in-tensorflow/
[-]
- thih9 1 hour ago
  I expected some job interview meme[1][2] but I did not know this one and it looks like a real story too! Thanks for sharing, that was a fun read.
  [1]: https://aphyr.com/posts/342-typing-the-technical-interview
  [2]: https://www.richard-towers.com/2023/03/11/typescripting-the-...
blauditore 1 hour ago
This could be obviously done with much less code: Just add "if"s for all even number, and at the end just return "odd" if none of the evens matched. 50% less code!
Or even simpler: If it's 0, return "even". If not, do a recursive call to n-1, if that equals "even", return "odd", otherwise return "even".
But the best way is probably to just use a library. Yes, 500MB of additional dependencies, but then it's a one-liner.
[-]
- majkinetor 1 hour ago
  But then, even numbers will have the worst possible performance.
- layer8 1 hour ago
  You could save stack space by transforming it into a loop. It’s still only O(n)!
wiz21c 4 hours ago
Gemini took 4 seconds to answer this prompt: "Here is a number 4200020010101. Think deeply about it and tell me if it is not or or not even."
So if you're concerned with privacy issues, you can run the assembly version proposed in the article locally and be well within the same order of performance.
Let's thank the author of the article for providing a decent alternative to Google.
ah, but the license is not that good we can't reproduce his code.
[-]
- brap 3 hours ago
  >Think deeply about it and tell me if it is not or or not even
  I think I just experienced a segfault
  [-]
  - sethaurus 2 hours ago
    Why do they call it even when you of in the true number of out false odd the number?
    [-]
    - pksebben 1 hour ago
      Has anyone really been far even as decided to use even go want to do Look more like?
  - iberator 39 minutes ago
    Hey, why segfault and not stack overflow?
  - bdangubic 2 hours ago
    +1
    [-]
    - rossant 2 hours ago
      Nice, that swaps odd and even around.
- classified 2 hours ago
  > if it is not or or not even
  Did you want to test the LLM's grammatical comprehension?
- moffkalast 1 hour ago
  Finally a problem that Microsoft Phi can ace. Probably. Maybe. Some of the time at least.
xg15 4 hours ago
> Now, this is a time-memory tradeoff, but my time on this earth is limited so I decided to meta-program the if statements using a programmer program in a different programming language.
```
  for i in range(2*8):
    if i % 2 == 0:
```
No comment...
[-]
- _ache_ 4 hours ago
  Yeah... I come here to talk about that. Should have been
```
  for i in range(0, 2**8, 2):
      print("    if (number == "+str(i)+")")
      print("        printf(\"even\\n\");")
      print("    if (number == "+str(i + 1)+")")
      print("        printf(\"odd\\n\");")
```
  or
```
  for i in range(0, 2**8, 2):
      print(f"""    if (number == {i})
          puts("even");
      if (number == {i + 1})
          puts("odd");""")
```
  [-]
  - NetMageSCW 1 hour ago
    What happens when you try to compute 2**8+1 ?
    [-]
    - iberator 38 minutes ago
      Should work fine with long long?
- PurpleRamen 3 hours ago
  I think we can improve this. Just make a microservice who generates the code on the fly and streams it to the compiler. Then you also just have to create the necessary code and don't waste the SSD with unused code-paths.
  [-]
  - travisgriggs 29 minutes ago
    I’m disappointed there is no docker image for this. How will I test it out?
- cowsandmilk 4 hours ago
  How horridly inefficient, he should have just flipped a Boolean on each iteration.
  [-]
  - kroolik 4 hours ago
    Horridly inefficient. Just unfold the loop.
  - projektfu 3 hours ago
    Just output odd and even for each pass and increment by two. Need to make sure you have the right starting value, and check for off-by-one errors.
- layer8 1 hour ago
  It should have used a flag that is being toggled.
mring33621 1 hour ago
I recently asked a Qwen model to write me some code to remove errant spaces ("c he es e" instead of "cheese") in OCR'd text. It proceeded to write 'if' blocks for every combo of all English words and all possible errant spaces. I did not wait for it to finish...
oneeyedpigeon 3 hours ago
I prefer data-driven programming, so a simple:
```
    return odd_or_evenness[n];
```
works for me, alongside a pretty big array.
[-]
- layer8 1 hour ago
  With data-driven programming, I would have expected an SQL table containing all the precomputed results. Unless you carelessly add an index, it has the same asymptotic performance!
- projektfu 3 hours ago
  You might like the IBM 1620.
  https://en.wikipedia.org/wiki/IBM_1620#Anecdotes
- Zambyte 1 hour ago
```
    const odd_or_evenness = comptime blk: {
        var buf: [16]bool = undefined;
        var is_even = false;
        for (&buf) |*b| {
            b.* = is_even;
            is_even = !is_even;
        }
        break :blk buf;
    };
```
  This looks like a promising strategy.
- enopod_ 3 hours ago
  This is the way.
nonethewiser 9 minutes ago
Configuration over logic. At a new scale enabled by AI.
orzig 4 hours ago
If the author is available for consulting I have this bag of rice I need cooked. Should be around 30,000 grains, each needs about 1mL of water and 2m on the stove. Will pay $10 (2025 dollars)
[-]
- franciscop 3 hours ago
  Aha, you forgot to specify the country of those "dollars"! For $10 (2025 [Cayman Islands] dollars)! Which is higher than USD10
  [-]
  - 867-5309 3 hours ago
    Zimbabwe: 0.027 USD
jagged-chisel 3 hours ago
I have never seen anyone argue for a ‘switch’ version.
```
    switch (v) {
     case: 0,2,4,8,…:
       return EVEN;
     case: 1,3,5,7,…:
       return ODD;
     default:
       return IDK;
    }
```
Slightly less code to generate.
[-]
- willguest 3 hours ago
  you forgot the logic to strip the final digit and assign it to v.
  processing the whole number is absurd
  [-]
  - pkaeding 2 hours ago
    I think the idea is to fill in the ellipses with even/odd numbers, up to 4B.
    You know, to save the performance cost of processing the input as a string, and chomping off all but the last character.
  - mgaunard 2 hours ago
    Converting to decimal is just as absurd.
    All you need is the final binary digit, which incidentally is the most optimal codegen, `v & 1`.
  - 71bw 2 hours ago
    Look at Mr. Rocket Scientist over here...
thewisenerd 5 hours ago
discussed 2 years ago,
https://news.ycombinator.com/item?id=38790597
4B If Statements (469 comments)
[-]
- unwind 5 hours ago
  Meta: Yeah, this should have a "(2023)" tag in the title. Thanks.
isoprophlex 5 hours ago
> Visionary genius Ross van der Gussom
Thanks for making me doubt myself & googling who that guy who made python was again, because surely "van der Gussom" isn't a normal Dutch name. Well played.
[-]
- mcny 3 hours ago
  Previously on HN
  https://news.ycombinator.com/item?id=38790754
riwsky 59 minutes ago
Would be more maintainable if they injected the loading strategy to be used as a dependency from config instead of hardcoding it :/
SiempreViernes 5 hours ago
> As a side note, the program is amazingly performant. For small numbers the results are instantaneous and for the large number close to the 2^32 limit the result is still returned in around 10 seconds.
Amazing!
mgaunard 2 hours ago
A much cooler approach would have been to generate the ASM from the same program, rather than generate a file from python and load that file from C++. The multi-stage build and filesystem are completely unnecessary.
The technique actually has a lot of practical applications, so it's useful to have a C++ library that helps you with generating amd64 machine code.
whynotmaybe 4 hours ago
> I decided to implement this in the C programming language as it’s by far the fastest language on the planet to this day (thanks to the visionary genius Dennis Richie)
Am I lost? Aren't the compiler/linker responsible for fast code, not the language itself?
[-]
- rcxdude 3 hours ago
  Both, usually. A language's semantics can limit how much a compiler can speed up the language. Python, for example, is extremely difficult to make fast due to the fact that almost everything has the semantics of a hashmap lookup. C, in comparison, has relatively little in it that can't be mapped fairly straightforwardly to assembly, and then most of it can be mapped in a more difficult way to faster assembly.
- mbivert 3 hours ago
  > I decided to use the slowest language on the planet, Python (thanks to the visionary genius of Ross van der Gussom).
  given the article, it's fair to assume the author was joking around
  that being said, the way the language is used and its ecosystem do contribute to the executable's efficiency. yet, given C's frugality, or the proximity between its instructions and the executed ones, it's not unfair to say that "C is fast"
- bluGill 1 hour ago
  There are language issues as well. 99% of C programs are valid C++, and if you compile with a C++ compiler instead of a C++ compiler will be slightly faster! C++ ha a stronger type system and so once in a while (very rarely) those C programs compile but give incorrect results since C++ allowed the optimizer to make an assumption that wasn't true. Fortran is often even faster because the language allows for even more assumptions. I don't know where Rust fits in here (Rust is hurt today because the better optimizes are designed for C++ and so don't take advantage of extra assumptions Rust allows - it was designed to allow different assumptions from C++ and likely could be better would a ground up optimizer but that would take a large team a decade+ to write: expensive)
  Most of the difference in speed is the optimizer/linker. Assuming a fair competition the difference between Ada, C, C++, D, Fortran, Rust, Zig (and many others I can't think of) is very rarely even 1% in any real world situation. Of course it isn't hard add pessimization to make any language lose, sometimes accidentally, so fair competitions are very hard to find/make.
  Then again, the article was clearly sarcastic.
- reedf1 4 hours ago
  It's a wild statement for a few reasons; your observation is one of them.
- croes 4 hours ago
  People often use language as synonym for the whole ecosystem including compiler and linker
philippta 4 hours ago
> I saw from the SSD was around 800 MB/s (which doesn’t really make sense as that should give execution speeds at 40+ seconds, but computers are magical so who knows what is going on).
If anyone knows what’s actually going on, please do tell.
[-]
- ricardo81 3 hours ago
  Presumably after the first run much or all of the program is paged into OS memory
  [-]
  - tomtomtom777 2 hours ago
    Yes, or it was still in memory from writing.
    The numbers match quite nicely. 40gb program size minus 32gb RAM is 8gb, divided by 800mb/s makes 10 seconds.
- hellzbellz123 3 hours ago
  I'm not entirely sure but could it be predictive branching?
1a527dd5 3 hours ago
I love "stupid" stuff like this; you normally learn something small and seemingly inane. It's fun!
layer8 1 hour ago
This is also a nice approach for FizzBuzz in leetcode interviews.
Moreover, interviewers will be smitten by the hand-optimized assembly code.
KeplerBoy 4 hours ago
kind of expected gcc to see right through the 300 gigs of code and compile it down to the tenish instructions.
[-]
- bspammer 3 hours ago
  They disabled optimisations:
  > Lets compile the code, disabling optimizations with /Od to make sure that the pesky compiler doesn’t interfere with our algorithm
  [-]
  - bluGill 1 hour ago
    I would have wanted to see them look at the assembly from various optimization levels to see if the compiler really did. Ideally o1 or something wouldn't see through this but would generate better code in other ways. Disabled optimizations often are really stupid about how they code gen.
- oneeyedpigeon 3 hours ago
  > disabling optimizations with /Od
  And that weird flag is because it's a windows compiler: cl.exe, not gcc.
- encom 2 hours ago
  I'm curious what GCC would do if it wasn't purposely lobotomised and fed 300 GB of this nuclear waste.
  [-]
  - encom 1 hour ago
    Well I created the 16 bit .c file, because I'm not that curious. gcc -O0 completed immediately and made a 1,5MB executable. -O1 took about 10 minutes for a 1,8 MB executable. -O2 has been running for 1h15m so far... i7-14700K
    I'm in too deep now, so I'll let it run while I'm at work.
tomaskafka 2 hours ago
Oh, I have an idea for better leftpad implementation, let me publish that to npm real quick!
ks2048 1 hour ago
Silly. Don't waste your time on problems other people have already solved! Use JS and "npm install odd_or_even".
shevy-java 2 hours ago
Damn it - my code got leaked!
wvbdmp 4 hours ago
Next put them in a tree for faster lookups.
[-]
- wiz21c 4 hours ago
  With a tree you'll be limited by the RAM. I advise to use a database.
  [-]
  - actionfromafar 3 hours ago
    Map reduce, cluster geo-failover and CDN caching for optimized coldstarts in case you have to bring it up from scratch in a new datacenter. Bio for contact info, hourly billing. I have helped many startups reach their first 100MARR. Buy my audiobook.
- d--b 3 hours ago
  Don't reinvent the wheel, put them in a SQLite table, and let the sql engine create the tree for you!
tigranbs 4 hours ago
This reminds me of my personal "prime number" grabber research https://github.com/tigranbs/prime-numbers I needed to create the unique graph nodes and assign prime numbers, and to make things efficient, I thought, why not just download the list of known prime numbers instead of generating them one by one. So I did and compiled everything with a single Go binary. Ehh, good old days with a nice feith in making "the best" crappy software out there.
DeathArrow 1 hour ago
I would also like to praise the visionary genius Ross van der Gussom, without whom this wonderful achievement in software engineering would not have been possible!
[-]
- klaff 1 hour ago
  Is he the one married to the singer, Adele Dazeem?
nmilo 1 hour ago
I see why now npm's is-odd has millions of downloads
tornadofart 2 hours ago
"The executable is around 2 MB"- Every dotnet programmer: "Those are rookie numbers!"

d-lisp 4 hours ago

if(n&1)

else

[-]

imtringued 4 hours ago

You can do it even faster with the if statements:

    #include <stdio.h>
    #include <stdlib.h>

    int main(int argc, char *argv[])
    {
        if (argc < 2) {
            fprintf(stderr, "Usage: %s <string>\n", argv[0]);
            return 1;
        }

        char *s = argv[1];
        int i;

        /* find the end of the string */
        for (i = 0; s[i] != '\0'; ++i)
            ;

        /* make sure the string wasn't empty */
        if (i == 0) {
            fprintf(stderr, "Error: empty string\n");
            return 1;
        }

        /* last character is at s[i - 1] */
        char d = s[i - 1];

        if (d == '0')
            printf("even\n");
        if (d == '1')
            printf("odd\n");
        if (d == '2')
            printf("even\n");
        if (d == '3')
            printf("odd\n");
        if (d == '4')
            printf("even\n");
        if (d == '5')
            printf("odd\n");
        if (d == '6')
            printf("even\n");
        if (d == '7')
            printf("odd\n");
        if (d == '8')
            printf("even\n");
        if (d == '9')
            printf("odd\n");
    
        return 0;
    }

gcc -std=c11 -Wall -Wextra -O2 -o check_digit check_digit.c

./check_digit 999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999

[-]

d-lisp 1 hour ago

You inspired me this joyful rewrite:

    #define _(e) { e;};
    #define r(e) _(return e)
    #define I(b, e) _(if (b) r(e));
    #define W(e) _(while (1) _(e));
    int main(int c, char **v) {
      _(I(c != 2, -1) _(c = 0) W(I(!v[1][c++], v[1][c - 2] & 1)))
    }

Tistron 4 hours ago
You can do it even even faster by replacing your if statements (works because the ascii values end in the digit they represent):
```
    if (d & 1)
       printf("odd\n");
    else 
       printf("even\n")
```

tetris11 3 hours ago

probably easier in bash:

    number="$1"
    if [[ "$number" =~ "^(2|4|6|8|10|12|14|16|18|20)$" ]]; then
        echo even
    elif [[ "$number" =~ "^(1|3|5|7|9|11|13|15|17|19)$" ]]; then
        echo odd
    else
        echo Nan
    fi

A bit limited, but you can scale it up

wiz21c 4 hours ago
I'm disappointed, it's not in rust. :-)
[-]
- actionfromafar 3 hours ago
  Still hoping for the C++ template version. Don’t pay for what you don’t use!

ajsnigrutin 2 hours ago
Why not optimize this? Create a lookup table, a 2^64 large array of bools, and just check the n-th element to see if it's odd or even?
Many gigabytes saved!
/s