More

jtbaker · 2026-01-18T19:12:20 1768763540

you didn't need to read to rewrite to C# to do that - python should be able to handle streaming that amount/velocity of data fine, at least through a native extension like msgspec or pydantic. additionally, you made it much harder for other data engineers that need to maintain/extend the project in the future to do so.

saberience · 2026-01-18T20:43:36 1768769016

The C# is probably far more maintainable and less error prone than Python. At least in my experience that's almost always the case.

The amount of Python jobs I've had which run fine for several hours and then break with runtime errors, whereas with C# you can be reliably sure that if it starts running it will finish running.

jtbaker · 2026-01-18T21:34:18 1768772058

Not a language problem, it's a dev culture problem. You can hold your devs accountable to the quality of their code. Strong er typing support via static analysis as well as runtime validation with untrusted input/data has really helped python alot.

I'm not necessarily the biggest fan of python, but writing a data engineering tool in a non-data engineering focused language seems like a bad decision. Now when the OP leaves the organization is in a much tougher position.

Rohansi · 2026-01-19T17:06:31 1768842391

> Now when the OP leaves the organization is in a much tougher position.

Are they really, though? You're assuming their org is unfamiliar with C#. Not all data engineers only know Python. The ones I work with mainly use C# because we all do!

jtbaker · 2026-01-19T21:53:12 1768859592

I'm a software and data engineer. I work with C# pretty extensively in my software day job. I've never seen a data engineer job listing mention C#.

Additionally, the way the OP's comment reads, I'm ok with the assumption I made. It reads like it was a unilateral decision on their part and not something that got buy in from the team.

jtbaker · 2026-01-16T23:21:18 1768605678

I'm glad they were able to pivot into Astro when Vite won the hot dev server game a few years back.

jtbaker · 2026-01-16T23:19:04 1768605544

I'd love to see D1 as a supported catalog for https://ducklake.select/

jtbaker · 2026-01-16T17:20:30 1768584030

Something different than https://duckdb.org/docs/stable/clients/java?

clumsysmurf · 2026-01-17T00:38:42 1768610322

Android doesn't use JDBC.

jtbaker · 2026-01-16T15:17:34 1768576654

> Nextjs has no support

From what I remember, you can't even run a NextJS app through vite?

mpeg · 2026-01-16T15:27:08 1768577228

Yes, that's part of the problem, deploying nextjs to cloudflare in the first place used to be an absolute nightmare, let alone the dev experience (I think it's better now)

sp4cec0wb0y · 2026-01-16T16:42:44 1768581764

Wasn't this a decision made by Vercel to incentivize people using Vercel for NextJS apps? I can't recall.

jamieatlason · 2026-01-16T17:34:30 1768584870

It's gotten a lot better since last year with OpenNext. Last I tested was Next.js 15 though. Who knows what Vercel has broken with Next.js 16.

https://opennext.js.org/cloudflare

Vinnl · 2026-01-16T19:08:57 1768590537

That doesn't sound too preposterous; I wouldn't assume you'd be able to run a React Router project on Turbopack or Webpack either, and Next.js I think has a way more intricate dependence on the bundler to power a significant chunk of its features.

mattgreenrocks · 2026-01-16T17:20:12 1768584012

This is insane to me, and validates my irrational dislike of next.

hungryhobbit · 2026-01-16T17:31:27 1768584687

Definitely irrational. There are lots of logical reasons to dislike Next (like the fact that they pile new shiny bit on top of new shiny bit without caring about the regular user experience) ... but being mad that it can't run on Vite is silly.

It's like being mad that Rails can't run on Python, or that React can't run on jQuery. Next already has its own build system, so of course it doesn't work with another build system.

mattgreenrocks · 2026-01-16T18:02:14 1768586534

Isn’t the next.js build system known for being slow/memory hungry?

mzronek · 2026-01-16T21:44:08 1768599848

Luckily DX is much better now with Turbopack as a bundler. First they improved the dev server, now with Turbo builds the production builds are faster as well. Still not fully stable in my opinion, but they will get there.

It's also wise to use monorepo orchestration with build caching like Turborepo.

They did well on the turbo stuff, no doubt about it.

The main bottleneck with big projects in my experience is Typescript. Looking forward to the Go rewrite. :)

pjmlp · 2026-01-16T21:23:16 1768598596

For those stuck in the past yes, they have replaced it with a Rust based toolchain, as is so fashionable nowadays.

jtbaker · 2026-01-16T17:26:25 1768584385

100% rational. Nuxt/Astro FTW.

jtbaker · 2026-01-16T06:10:10 1768543810

Isn't LNG a byproduct of the fracking process - and natural gas has taken over a good chunk of coal's role in our electricity generation?

defrost · 2026-01-16T06:47:17 1768546037

Extracting LNG may or may not be via fracking, and may come from conventional or unconvential fields.

The largest LNG gas fields currently producing are not being "fracked", eg:

https://en.wikipedia.org/wiki/South_Pars/North_Dome_Gas-Cond...

jtbaker · 2026-01-14T22:41:48 1768430508

I think of postal code as a generic, international form of the concept, not tied to a location.

jtbaker · 2026-01-14T14:57:03 1768402623

I feel like with custom vector based styles, you should be able to get pretty dang close to cloning the look of it? Also subjectively, I find the protomaps basemap themes to be much nicer.

jmuguy · 2026-01-15T02:52:14 1768445534

Yeah I agree, I found dozens of options that look (subjectively!) a hell of a lot better.

jtbaker · 2026-01-05T15:00:12 1767625212

Insert obnoxious tailwind comment

jtbaker · 2026-01-08T04:48:10 1767847690

I feel bad about this comment as of today's news. I love tailwind and feel like it has supercharged my ability to be productive with CSS, but recognize that it can be overprescribed.

jtbaker · 2025-12-31T06:56:14 1767164174

Look into using duckdb with remote http/s3 parquet files. The parquet files are organized as columnar vectors, grouped into chunks of rows. Each row group stores metadata about the set it contains that can be used to prune out data that doesn’t need to be scanned by the query engine. https://duckdb.org/docs/stable/guides/performance/indexing

LanceDB has a similar mechanism for operating on remote vector embeddings/text search.

It’s a fun time to be a dev in this space!

nextaccountic · 2026-01-02T01:44:45 1767318285

> Look into using duckdb with remote http/s3 parquet files. The parquet files are organized as columnar vectors, grouped into chunks of rows. Each row group stores metadata about the set it contains that can be used to prune out data that doesn’t need to be scanned by the query engine. https://duckdb.org/docs/stable/guides/performance/indexing

But, when using this on frontend, are portions of files fetched specifically with http range requests? I tried to search for it but couldn't find details

jtbaker · 2026-01-04T03:37:58 1767497878

Yes, you should be able to see the byte range requests and 206 responses from an s3 compatible bucket or http server that supports those access patterns.