169

Zen-C: Write like a high-level language, run like C

From what I can see in the codegen, defer is not implemented "properly": the deferred statements are only executed when the block exits normally; leaving the block via "return", "break", "continue" (including their labelled variants! those interact subtly with outer defers), or "goto" skips them entirely. Which, arguably, should not happen:

    var f = fopen("file.txt", "r");
    defer fclose(f);

    if fread(&ch, 1, 1, f) <= 0 { return -1; }
    return 0;
would not close file if it was empty. In fact, I am not sure how it works even for normal "return 0": it looks like the deferred statements are emitted after the "return", textually, so they only properly work in void-returning function and internal blocks.
10 hours agoJoker_vD

Did you manage to compile this example?

9 hours agoricardobeat

Yes, actually:

    $ cat kekw.zc
    include <stdio.h>
    
    fn main() {
        var f = fopen("file.txt", "r");
        defer fclose(f);
    
        var ch: byte;
        if fread(&ch, 1, 1, f) <= 0 { return -1; }
        return 0;
    }
    $ ./zc --emit-c kekw.zc
    [zc] Compiling kekw.zc...
    $ tail -n 12 out.c
    int main()
    {
        {
        __auto_type f = fopen("file.txt", "r");
        uint8_t ch;
    if ((fread((&ch), 1, 1, f) <= 0))     {
        return (-1);
        }
        return 0;
    fclose(f);
        }
    }
8 hours agoJoker_vD

> Mutability

> By default, variables are mutable. You can enable Immutable by Default mode using a directive.

> //> immutable-by-default

> var x = 10; > // x = 20; // Error: x is immutable

> var mut y = 10; > y = 20; // OK

Wait, but this means that if I’m reading somebody’s code, I won’t know if variables are mutable or not unless I read the whole file looking for such directive. Imagine if someone even defined custom directives, that doesn’t make it readable.

12 hours agoLucasoato

Given an option that is configurable, why would the default setting be the one that increases probability of errors?

For some niches the answer is "because the convenience is worth it" (e.g. game jams). But I personally think the error prone option should be opt in for such cases.

Or to be blunt: correctness should not be opt-in. It should be opt-out.

I have considered such a flag for my future language, which I named #explode-randomly-at-runtime ;)

12 hours agoandai

But why put it as a global metaswitcher instead of having different type infered from initial assignation qualifier?

Example:

    let integer answer be 42 — this is a constant
    set integer temperature be 37.2 — this is a mutable

Or with the more esoglyphomaniac fashion

    cst ↦ 123 // a constant is just a trivial map
    st ← 29.5 // initial assignment inferring float
8 hours agopsychoslave

> Or to be blunt: correctness should not be opt-in. It should be opt-out.

One can perfectly fine write correct programs using mutable variables. It's not a security feature, it's a design decision.

That being said, I agree with you that the author should decide if Zen-C should be either mutable or immutable by default, with special syntax for the other case. As it is now, it's confusing when reading code.

10 hours agomiroljub

> Given an option that is configurable, why would the default setting be the one that increases probability of errors?

They're objecting to the "given", though. They didn't comment either way on what the default should be.

Why should it be configurable? Who benefits from that? If it's to make it so people don't have to type "var mut" then replace that with something shorter!

(Also neither one is more 'correct')

6 hours agoDylan16807

It's not ideal but it seems like something an LSP could tell you on a hover event. I didn't see an LSP (I didn't look that hard either) but presumably that's within the scope of their mission statement to deliver modern language ergonomics. (But I agree with sibling comments that this should be a keyword. Another decent alternative would be that it's only global in scope.)

5 hours agomaxbond

Other languages also have non-local qays of influencing compiler behavior, for example attributes in rust (standard) or compiler pragmas in C (non-standard).

When reading working code, it doesn't matter whether the language mode allows variable reassignment. It only matters when you want to change it. And even then, the compiler will yell at you when you do the wrong thing. Testing it out is probably much faster than searching the codebase for a directive. It doesn't seem like a big deal to me.

9 hours agogmueckl

Yeah, immutability should probably use a `let` keyword and compiler analysis should enforce value semantics on those declarations.

11 hours agonetbioserror

Agreed, using `var` keyword for something that is non-var-ying (aka immutable) is not very intuitive.

7 hours agophcreery

Mutability is distinct from variability. In Javascript only because it's a pretty widely known syntax:

    const f = (x) => {
      const y = x + 1;
      return y + 1;
    }
y is an immutable variable. In f(3), y is 4, and in f(7), y is 8.

I've only glanced at this Zen-C thing but I presume it's the same story.

2 hours agoekipan

"immutable variable" is an oxymoron. Just because Javascript did it does not mean every new language has to do it the same way.

36 minutes agoDeanoumean

It's odd that the async/await syntax _exclusively_ uses threads under the hood. I guess it makes for a straightforward implementation, but in every language I've seen the point of async/await is to use an event loop/cooperative multitasking.

11 hours agoseabrookmx

Noob question: if it just compiles to threads, is there any need for special syntax in the first place? My understanding was that no language support should be required for blocking on a thread.

9 hours agosestep

One advantage is that it gives you the opportunity to move to a more sophisticated implementation later without breaking backwards compatibility (assuming the abstraction does not leak).

3 hours agomaxbond

Async/await should do a little more under the hood than what the typical OS threading APIs provide, for example forwarding function parameters and return values automatically instead of making the user write their own boilerplate structs for that.

9 hours agogmueckl

Syntax aside, how does this compare to Nim? Nim does similar, I think Crystal does as well? Not entirely sure about Crystal tbh. I guess Nim and Vala, since I believe both transpile to C, so you really get "like C" output from both.

13 hours agogiancarlostoro

From what I see, Zen-C aims to be "C with super-powers". It still uses C pointers for arrays and strings. It transpiles to single human-readable C file without symbol mangling. No safety. Not portable (yet?).

Nim is a full, independent modern language that uses C as one of its backends. It has its own runtime, optional GC, Unicode strings, bounds checking, and a huge stdlib. You write high-level Nim code and it spits out optimized C you usually don't touch.

Here’s a little comparison I put together from what I can find in the readme and code:

    Comparison          ZenC           Nim
    
    written in          C              Self-Hosted
    targets             C              C, C++, ObjC, JS, LLVM (via nlvm), Native (in-progress)
    platforms           POSIX          Linux, Windows, MacOS, POSIX, baremetal
    mm strategy         manual/RAII    ARC, ORC(ARC with cycle collector), multiple gc, manual
    generated code      human-readable optimized
    mangling            no             yes
    
    stdlib              bare           extensive/batteries-included
    
    compile-time code   yes            yes
    macros              comptime?      AST manipulation
    
    arrays              C arrays       type and size is retained at all times
    strings             C strings      have capacity and length, support Unicode
    bounds-checking     no             yes (optional)
an hour agoarchargelod

Crystal compiles directly to object code, using LLVM. It does provide the ability to interoperate with C code; as an example, I use this feature to call ncursesw functions from Crystal.

11 hours agobloovis

man I haven't heard anything about Vala in ages. is it still actively developed/used? how is it?

11 hours agojdc0589

Yes, it is actively being developed.

Quite easy to make apps with it and GNOME Builder makes it really easy to package it for distribution (creates a proper flatpak environment, no need to make all the boilerplate). It's quite nice to work with, and make stuff happen. Gtk docs and awful deprecation culture (deprecate functions without any real alternative) are still a PITA though.

10 hours agospookie

Vala is still being developed and used in the GNOME ecosystem. Boo, on the other hand, is pretty dead.

4 hours agodeaddodo

I was also going to mention this reminds me of Vala, which I haven't seen or heard from in 10+ years.

11 hours agolgunsch

Surprisingly theres a shocking number of GUI programs for Linux made with Vala, and ElementaryOS is built using Vala, and all their custom software uses Vala. So it's not dead, just a little known interesting language. :)

7 hours agogiancarlostoro

An interesting bit to me is that it compiles to (apparently) readable C, I'm not sure how one would use that to their advantage

I am not too familiar with C - is the idea that it's easier to incrementally have some parts of your codebase in this language, with other parts being in regular C?

13 hours agomorcus

one benefit is that a lot of tooling e.g. for verification etc. is built around C.

another is that it only has C runtime requirement, so no weird runtime stuff to impelement if youd say want to run on bare metal..you could output the C code and compile it to your target.

12 hours agosaidnooneever

i think so. The biggest hurdle with new languages is that you are cut off from a 3rdparty library ecosystem. Being compatible with C 3rd party libraries is a big win.

13 hours agozbendefy

Makes it easy to "try before you buy", too. If you decide it's not for you, you can "step out" and keep the generated C code and go from there.

12 hours agoactionfromafar

This isn't a very sane plan. The ~300 LOC example mini_grep (https://github.com/z-libs/Zen-C/blob/main/examples/tools/min...) compiles to a ~3.3k LOC monstrosity (https://pastebin.com/raw/6FBSpt1z). It's easier to rewrite the whole thing than going from the generated code.

At least for now, generated code shouldn't be considered something you're ever supposed to interact with.

11 hours agoforgotpwd16

I looked at it primed for the worst by your comment, but it’s honestly not so bad. A lot of setup, data type code and what looks like overloads.

4 hours agoactionfromafar

Very good point that I never considered! Thanks.

11 hours agomorcus

Initial commit was 24h ago, 363 stars, 20 forks already. Man, this goes fast.

13 hours agoGys

man has been posting a lot before the initial commit about his library. following the guy on linkedin.

12 hours agosaidnooneever

Could be bots.

13 hours agoworldsavior

It’s not, it’s just how hackernews works. You’ll see new projects hit 1k-10k stars in a matter of a day. You can have the best project, best article to you but if everyone else doesn’t think so it’ll always be at the bottom. Some luck involved too. Bots upvoting a post not organically I doubt is gonna live long on first page.

13 hours agoalexpadula

The stars are on GitHub, they can come from somewhere else, e.g. the author himself buying stars.

12 hours agoworldsavior

This is hella common. Companies have too much money to spend.

11 hours agoalexpadula

Hi, I'm the developer's father. Trust me, he hasn't bought a single star in his life—not even in Super Mario :p

7 hours agoDuendek86

Definitely could be, but the dev has been posting updates on Twitter for a while now. It could be just some amount of hype they have built.

13 hours agodirectmusic
[deleted]
13 hours ago

Basically C2/C3 but Rust influenced. Missed chance to call it C4.

12 hours agoforgotpwd16

So, the point of this language is to be able to write code with high productivity, but with the benefit of compiling it to a low level language? Overall it seems like the language repeats what ZIG does, including the C ABI support, manual memory management with additional ergonomics, comptime feature. The biggest difference that comes to mind quickly is that the creator of Zen-C states that it can allow for the productivity of a high level language.

13 hours agov_iter

It has stringly typed macros. It's not comparable to Zig's comptime, even if it calls it comptime:

    fn main() {
        comptime {
            var N = 20;
            var fib: long[20];
            fib[0] = (long)0;
            fib[1] = (long)1;
            for var i=2; i<N; i+=1 {
                fib[i] = fib[i-1] + fib[i-2];
            }

            printf("// Generated Fibonacci Sequence\n");
            printf("var fibs: int[%d] = [", N);
            for var i=0; i<N; i+=1 {
                printf("%ld", fib[i]);
                if (i < N-1) printf(", ");
            }
            printf("];\n");
        }

        print "Compile-time generated Fibonacci sequence:\n";
        for i in 0..20 {
            print f"fib[{i}] = {fibs[i]}\n";
        }
    }
It just literally outputs characters, not even tokens like rust's macros, into the compiler's view of the current source file. It has no access to type information, as Zig's does, and can't really be used for any sort of reflection as far as I can tell.

The Zig equivalent of the above comptime block just be:

    const fibs = comptime blk: {
        var f: [20]u64 = undefined;
        f[0] = 0;
        f[1] = 1;
        for (2..f.len) |i| {
            f[i] = f[i-1] + f[i-2];
        }
        break :blk f; 
    };
Notice that there's no code generation step, the value is passed seamlessly from compile time to runtime code.
12 hours agomesse

I wonder, how can a programming language have the productivity of a high-level language ("write like a high-level language"), if it has manual memory management? This just doesn't add up in my view.

I'm writing my own programming language that tries "Write like a high-level language, run like C.", but it does not have manual memory management. It has reference counting with lightweight borrowing for performance sensitive parts: https://github.com/thomasmueller/bau-lang

12 hours agothomasmg

C is literally a high level language.

19 minutes agojhgb

Nim is a high-level language as well and compiles to C.

13 hours agojohnisgood

Odin and Jai are others.

12 hours agothe__alchemist

Does Odin compile to C? I thought it only uses LLVM as a backend

12 hours agoVoycawojka

No, Odin does not compile to C. It is a standalone programming language that compiles directly to machine code. It primarily uses LLVM as its backend for compiling to machine code, like you said.

8 hours agojohnisgood

Same question but for Jai.

9 hours agosestep
[deleted]
7 hours ago

chicken scheme compiles to c as well. it's a pretty convenient compilation target, you get to use all the compilers and tool chains out there and you don't add a dependency on llvm

10 hours agozem

I love CHICKEN Scheme! Nice to see it mentioned. Though I think it's worth pointing out it compiles to something pretty far from handwritten C, to my understanding. I think this is true of both performance and semantics; for example you can return a pointer to a stack allocated struct from a foreign lambda (this is because chicken's generated C code here doesn't really "return", I think. Not an expert).

Of course you can always drop to manually written C yourself and it's still a fantastic language to interop with C. And CHICKEN 6 (still pre-release) improves upon that! E.g structs and Unions can be returned/passed directly by/to foreign functions, and the new CRUNCH extension/subset is supposed to compile to something quite a bit closer to handwritten C; there are even people experimenting with it on embedded devices.

9 hours agodieggsy

Chicken indeed interoperates with C quite easily and productively. You're right that the generated C code is mostly incomprehensible to humans, but compiles without difficulty.

The Chicken C API has functions/macros that return values and those that don't return. The former include the fabulous embedded API (crunch is an altogether different beast) which I've used in "mixed language" programming to good effect. In such cases Scheme is rather like the essential "glue" that enables the parts written in other languages to work as a whole.

Of course becoming proficient in Scheme programming takes time and effort. I believe it's true that some brains have an affinity for Lispy languages while others don't. Fortunately, there are many ways to write programs to accomplish a given task.

7 hours agojrapdx3

> this is because chicken's generated C code here doesn't really "return", I think. Not an expert.

not an expert either, but you're right about that, it uses cps transformations so that functions never return. there's a nice write up here: https://wiki.call-cc.org/chicken-compilation-process#a-guide...

8 hours agozem

I am working on mine as well. I think it is very sane to have some activity in this field. I hope we will have high level easy to write code that is fully optimized with very little effort.

12 hours agokuon

There are going to be lots of languages competing with Rust and Zig. It's a popular, underserved market. They'll all have their unique angle.

12 hours agoechelon

I has been served for several decades, however since the late-90's many decided reducing to only C and C++ was the way going forward, now the world is rediscovering it doesn't have to be like that.

12 hours agopjmlp

They're are certainly going to be lots of languages because now with LLMs it's easier (trivial?) to make one + library (case in point: just within last month there're have been posted here ~20 new langs with codebases 20k~100k LOC) but don't really see them competing. Rust and Zig brought actual improvements and are basically replacing usecases that C++/C had limiting the space available to others.

11 hours agoforgotpwd16

Nice! Compiles in 2s on my unexceptional hardware. But it lacks my other main desiderata in a new language: string interpolation and kebab-case.

8 hours agostevefolta

Oh, it _does_ have string interpolation, my bad. Sadly, not by default -- you still have to go back and add an "f" before the string once you've started typing it and then realize that you want an interpolated string. Also, it doesn't always work -- if I define two interpolated string variables in one function, GCC chokes in a way I'm not understanding. And every interpolated string variable consumes 4K of global memory.

an hour agostevefolta
[deleted]
13 hours ago

I wonder how this compares to the Beef programming language.

https://www.beeflang.org/

The Beef programming language was used to write Penny's Big Breakaway.

6 hours agoDwedit

That's a very nice project.

List of remarks:

> var ints: int[5] = {1, 2, 3, 4, 5};

> var zeros: [int; 5]; // Zero-initialized

The zero initialized array is not intuitive IMO.

> // Bitfields

If it's deterministically packed.

> Tagged unions

Same, is the memory layout deterministic (and optimized)?

> 2 | 3 => print("Two or Three")

Any reason not to use "2 || 3"?

> Traits

What if I want to remove or override the "trait Drawing for Circle" because the original implementation doesn't fit my constraints? As long as traits are not required to be in a totally different module than the struct I will likely never welcome them in a programming language.

13 hours agokreco

C uses `|` for bitwise OR and `||` for logical OR. I'm assuming this inherited the same operator paradigm since it compiles to C.

6 hours agojdmichal

The author includes some easter-eggs (printing random facts about Zen and various C constructs) which trigger randomly -- check out the file src/zen/zen_facts.c in the repository...

7 hours agotaolson

Is this the Typescript of C ?

13 hours agoalfonsodev

The tagline also applies to C :-)

5 hours agoZambyte

Answering the title, why not Julia?

4 hours agoArchit3ch

Example at the top of the readme!

5 hours agoaappleby

What's the performance hit?

12 hours agoblacksqr

18 commits! I hope you keep up with the project, it’s really cool, great work.

13 hours agoalexpadula

The whole language examples seem pretty rational, and I'm especially pleased / shocked by the `loop / repeat 5` examples. I love the idea of having syntax support for "maximum number of iterations", eg:

    repeat 3 {
       try { curl(...) && break }
       except { continue }
    }
...obviously not trying to start any holy wars around exceptions (which don't seem supported) or exponential backoff (or whatever), but I guess I'm kindof shocked that I haven't seen any other languages support what seems like an obvious syntax feature.

I guess you could easily emulate it with `for x in range(3): ...break`, but `repeat 3: ...break` feels a bit more like that `print("-"*80)` feature but for loops.

12 hours agoramses0

Ruby has a similarly intuitive `3.times do ... end` syntax

12 hours agozeknife

go also has

    for range 5 { ... }
10 hours agoioasuncvinvaer

But at that point why not Rust then?

2 hours agosebastianconcpt

Am I the only one who saw this syntax and immediately though "Man, this looks almost identical to Rust with a few slight variations"?

13 hours agoethin

It seems to just be Rust for people who are allergic to using Rust.

It looks like a fun project, but I'm not sure what this adds to the point where people would actually use it over C or just going to Rust.

13 hours agoCupricTea

> what this adds

I guess the point is what is subtracts, instead - answer being the borrow-checker.

13 hours agonnevatie

> answer being the borrow-checker

There is an entire world in Rust where you never have to touch the borrow-checker or lifetimes at all. You can just clone or move everything, or put everything in an Arc (which is what most other languages are doing anyway). It's very easy to not fight the compiler if you don't want to.

Maybe the real fix for Rust (for people that don't want to care), is just a compiler mode where everything is Arc-by-default?

13 hours agopetcat

So it re-adds manual lifetime checking. Got it.

13 hours agoYgg2

It might or might not be a toy project, I'm not sure, but one advantage of subtracting the borrow checking is that the compiler avoids a lot of complex machinery.

Borrow checking in Rust isn't sound AFAIK, even after all these years, so some of the problems with designing and implementing lifetimes, region checking, and borrow checking algorithms, aren't trivial.

5 hours agoaskyng

Maybe take the parts of rust the author likes, but still encourages pointers in high level operations?

13 hours agothe__alchemist

I thought the same and felt it looked really out of place to have I8 and F32 instead of i8 and f32 when so much else looks just like Rust. Especially when the rest of the types are all lower case.

13 hours agosuioir

Agreed, that really stood out as a ... questionable design decision, and felt extremely un-ergonomic which seems to go against the stated goals of the language.

13 hours agounwind

Every language is apparently required to make one specific version of these totally arbitrary choices, like whether to call the keyword function, func, fun, fn, or def. Once they do, it’s a foolish inconsistency with everything else. What if the language supported every syntax?

12 hours agohyperhello

My immediate thought was it looked a lot like Swift

13 hours agoturbotim
[deleted]
13 hours ago
[deleted]
13 hours ago

Why not compile to rust or assembly? C seems like an odd choice.

In fact why not simply write rust to begin with?

13 hours agoGrowingSideways

Assembly requires way more work than compiling to, say C. Clang and gcc do a lot of the heavy lifting regarding optimisation, spilling values to the stack, etc

13 hours agoxnacly

Then you're stuck with the C stack, though, and no way to collect garbage.

13 hours agoGrowingSideways
[deleted]
8 hours ago

really? you cant track and count your pointers in C? why not?

12 hours agosaidnooneever

I have a couple interpreters I've been poking at and one uses 'musttail' while the other uses a trampoline to get around blowing up the C stack when dispatching the operators. As for the GC, the trampoline VM has a full-blown one (with bells-and-whistles like an arena for short lived intermediate results which get pushed/popped by the compiled instructions) while the other (a peg parser VM) just uses an arena as the 'evaluation' is short lived and the output is the only thing really needing tracking so uses reference counting to be (easily) compatible with the Python C-API. No worries about the C stack at all.

I mean, I could have used the C stack as the VM's stack but then you have to worry about blowing up the stack, not having access (without a bunch of hackery, looking at you scheme people) to the values on the stack for GC and whatnot and, I imagine, all the other things you have issues with but it's not needed at all, just make your own (or, you know, tail call) and pretend the C one doesn't exist.

And I've started on another VM which does the traditional stack thing but it's constrained (by the spec) to have a maximum stack depth so isn't too much trouble.

11 hours agoUncleEntity

If I understand the history correctly then it started as a set of C preprocessor macros.

12 hours agodfox

[flagged]

13 hours agokuttel2

> C is the patrician's choice.

Is this suppose to be a positive thing? I thought we all wanted to violently murder the patricians.

Regardless, C might be a valid IR. I apologize for being bigoted.

13 hours agoGrowingSideways

Try it again and see how it goes.