I think the problem is actually the opposite. There aren't enough abstractions t...

avip · on Oct 10, 2020

As a programmer I’m mostly measured on how fast and not buggy (in that order) I can ship. No one is profiling my code or asking O(wtf) it takes.

So from crap import * is pragmatic and practical. Short term at least.

saagarjha · on Oct 10, 2020

> No one is profiling my code or asking O(wtf) it takes.

That you know of.

eklavya · on Oct 10, 2020

But the compiler is going to shake the tree and not include anything not being used right?

tylerhou · on Oct 10, 2020

Not sure if this is sarcastic, but...

1) In dynamic languages, it's not possible to detect whether a function is used or not in the general case. For example, consider string accesses on objects. If the compiler is not sophisticated enough to resolve the set of possible strings at compile time (or such analysis would unacceptably increase compile times), then you can't shake out unused methods on that object. [1]

2) For languages like C and C++, the compiler cannot tree shake because it only knows about a single file at a time (translation unit, to be precise). You would have to rely on link-time optimizations to effectively tree shake, but LTO is not well-supported by all toolchains.

Tree shaking also has a cost that I mentioned earlier -- it increases compile times. Both LTO and tree shaking in dynamic languages increase compile times superlinearly [citation needed] wrt. the size of your application. As other commenters have mentioned, it's better to avoid including unnecessary libraries in the first place.

There are no cost free abstractions, and that applies to tree shaking as well. https://www.youtube.com/watch?v=rHIkrotSwcc

[1] For the pedants: yes, I know resolving the possible set of values (stricter than "all possible values for that type") for a variable is undecidable in the general case.

amelius · on Oct 10, 2020

You could perhaps better use a lazy-loading strategy instead of a static analysis. (But this would change the semantics in case of an existing language that allows side-effects while loading modules, unless you have a lazy strategy for them too ...; and then there are the errors you'd have to deal with)

tylerhou · on Oct 10, 2020

To truly achieve the same thing as "tree shaking," the function call overhead would be abysmal. You'd have to check whether the module was loaded already (with synchronization if your program is multithreaded). For single threaded programs, you could avoid this by hot-patching your machine code, but there's no way around some synchronization barrier in multithreaded programs [1]. In JavaScript (or any language where you want to avoid sending a large bundle over the network), you'd incur the latency of a network request for the first call to any function.

You're right that people are already splitting apps into bundles, but that is usually done at the page level.

[1] You could probably avoid having to take a mutex after the first call to the function with self patching code, but that sounds incredibly ugly and could have other implications (self-modifying code could be detected as a virus; could be used as a gadget to exploit some other vulnerability).

amelius · on Oct 10, 2020

Isn't self-modifying code already the norm, with JIT compilers?

tylerhou · on Oct 10, 2020

I am under the impression that JIT compilers do not modify the compiler’s own bytecode. They write generated code into a separate data region and mark that region as executable. If the code needs to modified, then control transfers back to the compiler which can mark the region as writable again.

The same can be done for a binary and it’s own code, but I wonder if it’s used as a signal in antivirus protections if done too often.

https://en.m.wikipedia.org/wiki/W%5EX

mcguire · on Oct 10, 2020

... which is why JIT compiled code is slower in some circumstances.

cbsmith · on Oct 10, 2020

The ol' sufficiently smart compiler myth. It's a classic.

aszen · on Oct 10, 2020

See the Elm to JS compiler, it does deep unused code elimination at the function level so you only ship the code you actually use (or the functions u actually use, even inside libraries).

The closure js compiler is also pretty good if you prefer writing js.

If we only start using better languages in our applications we could improve performance quite a bit.

cbsmith · on Oct 11, 2020

Dead code elimination has been available in C compilers since... so long ago I can't remember. It's not that compilers can't do it, but much like the halting problem, they really can't catch it all.

jamil7 · on Oct 10, 2020

I can’t tell if this is sarcastic or not. Tree shaking isn’t a silver bullet and really only works under specific circumstances. Better to not include the library to begin with.

kwhitefoot · on Oct 10, 2020

Just program in Fortran IV then the linker will only include routines that are called.

selecsosi · on Oct 10, 2020

> There aren't enough good abstractions