Python 3.15's JIT is now back on track

(fidget-spinner.github.io)

377 points | by guidoiaquinti 15 hours ago

21 comments

  • mattclarkdotnet 6 hours ago
    Python really needs to take the Typescript approach of "all valid Python4 is valid Python3". And then add value types so we can have int64 etc. And allow object refs to be frozen after instantiation to avoid the indirection tax.

    Sensible type-annotated python code could be so much faster if it didn't have to assume everything could change at any time. Most things don't change, and if they do they change on startup (e.g. ORM bindings).

    • mattclarkdotnet 6 hours ago
      To clarify, it is nuts that in an object method, there is a performance enhancement through caching a member value.

        class SomeClass
          def init(self)
            self.x = 0
          def SomeMethod(self)
            q = self.x
            ## do stuff with q, because otherwise you're dereferencing self.x all the damn time
      • dekelpilli 5 hours ago
        Java also has a performance cost to accessing class fields, as exampled by this (now-replaced) code in the JDK itself - https://github.com/openjdk/jdk/blob/jdk8-b120/jdk/src/share/...
        • anematode 3 hours ago
          Any decent JIT compiler (and HotSpot's is world class) will optimize this out. Likely this was done very early on in development, or was just to reduce bytecode size to promote inlining heuristics that use it
          • LtWorf 2 hours ago
            But what if whatever you call is also accessing and changing the attribute?
            • anematode 2 hours ago
              If what you call gets inlined, then the compiler can see that it either does or doesn't modify the attribute and optimize it accordingly. Even virtual calls can often be inlined via, e.g., class hierarchy analysis and inline caches.

              If these analyses don't apply and the callee could do anything, then of course the compiler can't keep the value hoisted. But a function call has to occur anyway, so the hoisted value will be pushed/popped from the stack and you might as well reload it from the object's field anyway, rather than waste a stack slot.

        • Kamii0909 56 minutes ago
          That was a niche optimization primarily targeting code at intepretor. Even the most basic optimizing compiler in HotSpot tiered compilation chain at that time (the client compiler or C1) would be able to optimize that into the register. Since String is such an important class, even small stuffs like this is done.
      • duskdozer 2 hours ago
        You mean even if x is not a property?
      • mathisfun123 5 hours ago
        > it is nuts that in an object method, there is a performance enhancement through caching a member value

        i don't understand what you think is nuts about this. it's an interpreted language and the word `self` is not special in any way (it's just convention - you can call the first param to a method anything you want). so there's no way for the interpreter/compiler/runtime to know you're accessing a field of the class itself (let alone that that field isn't a computed property or something like that).

        lots of hottakes that people have (like this one) are rooted in just a fundamental misunderstanding of the language and programming languages in general <shrugs>.

        • mattclarkdotnet 4 hours ago
          What's nuts is that the language doesn't guarantee that successive references to the same member value within the same function body are stable. You can look it up once, go off and do something else, and look it up again and it's changed. It's dynamism taken to an unnecessary extreme. Nobody in the real world expects this behaviour. Making it just a bit less dynamic wouldn't change the fundamentals of the language but it would make it a lot more tractable.
          • Someone 3 hours ago
            > What's nuts is that the language doesn't guarantee that successive references to the same member value within the same function body are stable.

            The language supports multiple threads and doesn’t have private fields (https://docs.python.org/3/tutorial/classes.html#private-vari...), so the runtime cannot rule out that the value gets changed in-between.

            And yes, it often is obvious to humans that’s not intended to happen, and almost never what happens, but proving that is often hard or even impossible.

          • fulafel 4 hours ago
            > Nobody in the real world expects this behaviour.

            For example, numbers and strings are immutable objects in Python. If self.x is a number and its numeric value is changed by a method call, self.x will be a different object after that. I'd dare say people expect this to work.

          • codesnik 3 hours ago
            basically all object oriented languages work like that. You access a member; you call a method which changes that member; you expect that change is visible lower in the code, and there're no statically computable guarantees that particular member is not touched in the called method (which is potentially shadowed in a subclass). It's not dynamism, even c++ works the same, it's an inherent tax on OOP. All you can do is try to minimize cost of that additional dereference. I'm not even touching threads here.

            now, functional languages don't have this problem at all.

            • cherryteastain 1 hour ago
              OOP has nothing to do with it. In your C++ example, foo(bar const&); is basically the same as bar.foo();. At the end of the day, whether passing it in as an argument or accessing this via the method call syntax it's just a pointer to a struct. Not to mention, a C++ compiler can, and often does, choose to put even references to member variables in registers and access them that way within the method call.

              This is a Python specific problem caused by everything being boxed by default and the interpreter does not even know what's in the box until it dereferences it, which is a problem that extends to the "self" object. In contrast in C++ the compiler knows everything there's to know about the type of this which avoids the issue.

              • josefx 3 minutes ago
                > This is a Python specific problem caused by everything being boxed

                I would say it is part python being highly dynamic and part C++ being full of undefined behavior.

                A c++ compiler will only optimize member access if it can prove that the member isn't overwritten in the same thread. Compatible pointers, opaque method calls, ... the list of reasons why that optimization can fail is near endless, C even added the restrict keyword because just having write access to two pointers of compatible types can force the compiler to reload values constantly. In python anything is a function call to some unknown code and any function could get access to any variable on the stack (manipulating python stack frames is fun).

                Then there is the fun thing the C++ compiler gets up to with varibles that are modified by different threads, while(!done) turning into while(true) because you didn't tell the compiler that done needs to be threadsafe is always fun.

              • adrian17 11 minutes ago
                That's not true. I mean: it's true that it has little to do with OOP, but most imperative languages (only exception I know is Rust) have the issue, it's not "Python specific". For example (https://godbolt.org/z/aobz9q7Y9):

                struct S { const int x; int f() const; }; int S::f() const { int a = x; printf("hello\n"); int b = x; return a-b; }

                The compiler can't reuse 'x' unless it's able to prove that it definitely couldn't have changed during the `printf()` call - and it's unable to prove it. The member is loaded twice. C++ compilers can usually only prove it for trivial code with completely inlined functions that doesn't mutate any external state, or mutates in a definitely-not-aliasing way (strict aliasing). (and the `const` don't do any difference here at all)

                In Python the difference is that it can basically never prove it at all.

          • rtpg 4 hours ago
            In Python attribute access aren't stable! `self.x` where `x` is a property is not guaranteed to refer to the same thing.

            And getting rid of descriptors would be a _fundamental change to the language_. An immeense one. Loads of features are built off of descriptors or descriptor-like things.

            And what you're complaining about is also not true in Javascript world either... I believe you can build descriptor-like things in JS now as well.

            _But_ if you want that you can use stuff like mypyc + annotations to get that for you. There are tools that let you get to where you want. Just not out of the box because Python isn't that language.

            Remember, this is a scripting language, not a compiled language. Every optimization for things you talk about would be paid on program load (you have pyc stuff but still..)

            Gotta show up with proof that what you're saying is verifiable and works well. Up until ~6 or 7 years ago CPython had a concept of being easy to onboard onto. Dataflow analyses make the codebase harder to deal with.

            Having said all of that.... would be nice to just inline RPython-y code and have it all work nicely. I don't need it on everything and proving safety is probably non-trivial but I feel like we've got to be closer to doing this than in the past.

            I ... think in theory the JIT can solve for that too. In theory

          • mathisfun123 3 hours ago
            > same member value within the same function body are stable

            Did you miss the part where I explained to you there's no way to identify that it's a member variable?

            > Nobody in the real world expects this behaviour

            As has already been explained to you by a sibling comment you are in fact wrong and there are in fact plenty of people in the real world who do actually expect this behavior.

            So I'll repeat myself: lots of hottakes from just pure. Unadulterated, possibly willful, ignorance.

            • coldtea 1 hour ago
              The above is a very thick response that doesn't address the parent's points, just sweeps them under the rag with "that's just how it was designed/it works".

              "Did you miss the part where I explained to you there's no way to identify that it's a member variable?"

              No, you you did miss the case where that in itself can be considered nuts - or at least an unfortunate early decision.

              "this just how things are dunn around diz here parts" is not an argument.

        • EE84M3i 25 minutes ago
          > the word `self` is not special in any way (it's just convention - you can call the first param to a method anything you want).

          The name `self` is a convention, yes, but interestingly in python methods the first parameter is special beyond the standard "bound method" stuff. See for example PEP 367 (New Super) for how `super()` resolution works (TL;DR the super function is a special builtin that generates extra code referencing the first parameter and the lexically defining class)

        • bmitc 4 hours ago
          I don't think it's a hot take to say much of Python's design is nuts. It's a very strange language.
    • stabbles 2 hours ago
      That was how the Mojo language started. And then soon after the hype they said that being a superset of Python was no longer the goal. Probably because being a superset of Python is not a guarantee for performance either.
    • wolvesechoes 1 hour ago
      > Python really needs to take the Typescript approach of "all valid Python4 is valid Python3"

      It is called type hints, and is already there. TS typing doesn't bring any perf benefits over plain JS.

      • stabbles 1 hour ago
        You really need dedicated types for `int64` and something like `final`. Consider:

            class Foo:
              __slots__ = ("a", "b")
              a: int
              b: float
        
        there are multiple issues with Python that prevent optimizations:

        * a user can define subtype `class my_int(int)`, so you cannot optimize the layout of `class Foo`

        * the builtin `int` and `float` are big-int like numbers, so operations on them are branchy and allocating.

        and the fact that Foo is mutable and that `id(foo.a)` has to produce something complicates things further.

        • wolvesechoes 39 minutes ago
          Maybe, but I quoted specific part I was replying to. TS has no impact on runtime performance of JS. Type hints in Python have no impact on runtime performance of Python (unless you try things like mypyc etc; actually, mypy provides `from mypy_extensions import i64`)

          Therefore Python has no use for TS-like superset, because it already has facilities for static analysis with no bearing on runtime, which is what TS provides.

          • wiseowise 1 minute ago
            What OP means is that they need to:

            1) Add TS like language on top of Python in backwards compatible way

            2) Introduce frozen/final runtime types

            3) Use 1 and 2 to drive runtime optimizations

    • BiteCode_dev 10 minutes ago
      There will be not Python 4, and 3.X policy requires forward compat, so we are already there.
    • bloppe 5 hours ago
      But that's just not what python is for. Move your performance-critical logic into a native module.
      • mattclarkdotnet 5 hours ago
        Performance is one part of the discussion, but cleanliness is another. A Python4 that actually used typing in the interpreter, had value types, had a comptime phase to allow most metaprogramming to work (like monkey patching for tests) would be great! It would be faster, cleaner, easier to reason about, and still retain the great syntax and flexibility of the language.
        • mechsy 3 hours ago
          I too see potential in this - it started feeling a bit weird in recent years switching between Go, Python and Rust codebases with Python code looking more and more like a traditional statically typed language and not getting the performance benefits. I know I know, there are libraries and frameworks which make heavy use of fun stuff you can do with strings (leading to the breakdown of even the latest and greatest IDE tooling and red squiggly lines all over you code) and don’t get me started on async etc.

          Funnily enough I’ve found Python to be excellent for modelling my problem domain with Pydantic (so far basically unparalleled, open for suggestions in Go/Rust), while the language also gets out of my way when I get creative with list expressions and the like. So overall, still it is extremely productive for the work I’m doing, I just need to spin up more containers in prod.

    • panzi 6 hours ago
      Isn't rpython doing that, allowing changes on startup and then it's basically statically typed? Does it still exist? Was it ever production ready? I only once read a paper about it decades ago.
      • mattclarkdotnet 6 hours ago
        RPython is great, but it changes semantics in all sorts of ways. No sets for example. WTF? The native Set type is one of the best features of Python. Tuples also get mangled in RPython.
    • rich_sasha 5 hours ago
      I think sadly a lot of Python in the wild relies heavily, somewhere, on the crazy unoptimisable stuff. For example pytest monkey patches everything everywhere all the time.

      You could make this clean break and call it Python 4 but frankly I fear it won't be Python anymore.

      • fyrn_ 3 hours ago
        As a person who has spent a lot of time with pytest, I'm ready for testing framework that doesn't do any of that non-obvious stuff. Generally use unittest as much as I can these days, so much less _wierd_ about how it does things. Like jeeze pytest, do you _really_ need to stress test every obscure language feature? Your job is to call tests.
      • mattclarkdotnet 5 hours ago
        Allowing metaprogramming at module import (or another defined phase) would cover most monkey patching use cases. From __future__ import python4 would allow developers to declare their code optimisable.
    • mattclarkdotnet 6 hours ago
      Oh, and while we're at it, fix the "empty array is instantiated at parse time so all your functions with a default empty array argument share the same object" bullshit.
      • zahlman 3 hours ago
        We don't call them "arrays".

        It has nothing to do with whether the list is empty. It has nothing to do with lists at all. It's the behaviour of default arguments.

        It happens at the time that the function object is created, which is during runtime.

        You only notice because lists are mutable. You should already prefer not to mutate parameters, and it especially doesn't make sense to mutate a parameter that has a default value because the point of mutating parameters is that the change can be seen by the caller, but a caller that uses a default value can't see the default value.

        The behaviour can be used intentionally. (I would argue that it's overused intentionally; people use it to "bind" loop variables to lambdas when they should be using `functools.partial`.)

        If you're getting got by this, you're fundamentally expecting Python to work in a way that Pythonistas consider not to make sense.

        • Revisional_Sin 1 hour ago
          It's best practice to avoid mutable defaults even if you're not planning to mutate the argument.

          It's just slightly annoying having to work around this by defaulting to None.

      • Izkata 5 hours ago
        Execution time, not parse time. It's a side effect of function declarations being statements that are executed, not the list/dict itself. It would happen with any object.
        • mattclarkdotnet 5 hours ago
          It's still ridiculous. A hypothetical Python4 would treat function declarations as declarations not executable statements, with no impact on real world code except to remove all the boilerplate checks.
          • zahlman 3 hours ago
            There is no such thing as a "function declaration" in Python. The keyword is "def", which is the first three letters of the word "define" (and not a prefix of "declare"), for a reason.

            The entire point of it being an executable statement is to let you change things on the fly. This is key to how the REPL works. If I have `def foo(): ...` twice, the second one overwrites the first. There's no need to do any checks ahead of time, and it works the same way in the REPL as in a source file, without any special logic, for the exact same reason that `foo = 1` works when done twice. It's actually very elegant.

            People who don't like these decisions have plenty of other options for languages they can use. Only Python is Python. Python should not become not-Python in order to satisfy people who don't like Python and don't understand what Python is trying to be.

          • boxed 4 hours ago
            You think so but then you write a function with a default argument pointing to some variable that is a list and now suddenly the semantics of that are... what?
            • codesnik 3 hours ago
              you could just treat argument initialization as an executable expression which is called every time you call a function. If you have a=[], then it's a new [] every time. If a=MYLIST then it's a reference to the same MYLIST. Simple. And most sane languages do it this way, I really don't know why python has (and maintain) this quirk.
        • mattclarkdotnet 5 hours ago
          Let's not get started on the cached shared object refs for small integers....
          • zahlman 3 hours ago
            What realistic use case do you have for caring about whether two integers of the same value are distinct objects? Modern versions of Python warn about doing unpredicatble things with `is` exactly because you are not supposed to do those things. Valid use cases for `is` at all are rare.
            • thaumasiotes 1 hour ago
              > Valid use cases for `is` at all are rare.

              There might not be that many of them, depending on how you count, but they're not rare in the slightest. For example, you have to use `is` in the common case where you want the default value of a function argument to be an empty list.

      • zeratax 43 minutes ago
        there is PEP 671 for that, which introduces extra syntax for the behavior you want. people rely on the current behavior so you can't really change it
    • musicale 4 hours ago
      > Python really needs to take the Typescript approach of "all valid Python4 is valid Python3

      Great idea, but I'm not convinced that they learned anything from the Python 2 to 3 transition, so I wouldn't hold my breath.

      If you want a language system without contempt for backward compatibility, you're probably better off with Java/C++/JavaScript/etc. (though using JS libraries is like building on quicksand.) Bit of a shame since I want to like Python/Rust/Swift/other modern-ish languages, but it turns out that formal language specifications were actually a pretty good idea. API stability is another.

      • musicale 2 hours ago
        is that you, python core dev team? ;-)
  • adrian17 13 hours ago
    I'm been occasionally glancing at PR/issue tracker to keep up to date with things happening with the JIT, but I've never seen where the high level discussions were happening; the issues and PRs always jumped right to the gritty details. Is there anywhere a high-level introduction/example of how trace projection vs recording work and differ? Googling for the terms often returns CPython issue tracker as the first result, and repo's jit.md is relatively barebones and rarely updated :(

    Similarly, I don't entirely understand refcount elimination; I've seen the codegen difference, but since the codegen happens at build time, does this mean each opcode is possibly split into two (or more?) stencils, with and without removed increfs/decrefs? With so many opcodes and their specialized variants, how many stencils are there now?

    • flakes 13 hours ago
      You’ll probably want to look to the PEPs. Havent dug into this topic myself but looks related https://peps.python.org/pep-0744/
      • adrian17 12 hours ago
        I think CPython already had tier2 and some tracing infrastructure when the copy-and-patch JIT backend was added; it's the "JIT frontend" that's more obscure to me.
    • rtpg 4 hours ago
      discussions might be happening on the Python forums, which are pretty active.

      https://discuss.python.org/t/pep-744-jit-compilation/50756/8... here's one thing

      I do think you can also just outright ask questions about it on the forums and you'll get some answers.

      At the end of the day there's only so many people working on this though.

    • saikia81 12 hours ago
      have you read the dev mailing list? There the developers of python discuss lots.
      • pansa2 11 hours ago
        There isn’t a dev mailing list any more, is there? Do you mean the Discord forum?
    • sheepscreek 12 hours ago
      UPDATE: I misunderstood the question :-/ You can ignore this.

      I love playing with compilers for fun, so maybe I can shed some light. I’ll explain it in a simplified way for everyone’s benefit (going to ignore the stack):

      When an object is passed between functions in Python, it doesn’t get copied. Instead, a reference to the object’s memory address is sent. This reference acts as a pointer to the object’s data. Think of it like a sticky note with the object’s memory address written on it. Now, imagine throwing away one sticky note every time a function that used a reference returns.

      When an object has zero references, it can be freed from memory and reused. Ensuring the number of references, or the “reference count” is always accurate is therefore a big deal. It is often the source of memory leaks, but I wouldn’t attribute it to a speed up (only if it replaces GC, then yes).

      • yuliyp 12 hours ago
        what at all does this comment have to do with what it's replying to?
        • sheepscreek 10 hours ago
          I misread the original comment, thinking it was a question about what is refcount elimination, than how it affects the JIT's performance(?).
  • owaislone 11 hours ago
    Oh man, Python 2 > 3 was such a massive shift. Took almost half a decade if not more and yet it mainly changing superficial syntax stuff. They should have allowed ABIs to break and get these internal things done. Probably came up with a new, tighter API for integrating with other lower level languages so going forward Python internals can be changed more freely without breaking everything.
    • scorpioxy 11 hours ago
      The text encoding stuff wasn't a small change considering what it could break, at least. And remember we're sometimes talking about software that would cost a lot of money to migrate or upgrade. I still maintain some 2.x python code-bases that will be very expensive to migrate and the customer is not willing to invest that money.

      Although your general sentiment is something I agree with(if it's going to be painful do it and get it over with), I don't believe anybody knew or could've guessed what the reaction of the ecosystem would be.

      Your last point about being able to change internals more freely is also great in theory but very difficult(if not impossible) to achieve in practice.

      I don't know. Having maintained some small projects that were free and open source, I saw the hostility and entitlement that can come from that position. And those projects were a spec of dust next to something like Python. So I think the core team is doing the best they can. It was always going to be damned if you do, damned if you don't.

      • eru 8 hours ago
        > I still maintain some 2.x python code-bases that will be very expensive to migrate and the customer is not willing to invest that money.

        Slight tangent: if Claude can decimate IBM stock price by migrating off Cobol for cheap, surely we can do Python 2 to 3 now, too?

        About the internals: we sort of missed an opportunity there, but back then there also didn't quite know what they were doing (or at least we have better ideas of what's useful today). And making the step from 2 to 3 even bigger might have been a bad idea?

        • scorpioxy 7 hours ago
          I wasn't aware that migrating projects off Cobol has become cheap and it would only take a Claude subscription.

          In my experience, the problem had always been maintaining the business logic and any integrations with third-party software that also may be running legacy code-bases or have been abandoned. It can get quite complicated, from what I've seen. Now of course if you're talking about well maintained code-bases with 100%, or close to 100% test coverage, and that includes the integration part along with having the ability to maintain the user experience and/or user interface then yes it becomes a relatively easy process of "just write the code". But, in my experience, this has never been the case.

          For the 2.x code-bases I maintain, the customers simply doesn't want to pay for any of it. They might choose to at a later time, but so far it has been more cost effective for them to pay me to maintain that legacy code than pay to have it migrated. Other customers have different needs and thus budget differently.

          I'll refrain from judging if 2 to 3 was a missed opportunity or not. I believe the core team does actually know what they're doing and that any decision would've been criticized.

          • Tempest1981 6 hours ago
            IBM shares fell 13% in a single day in last month:

            "IBM Sinks Most Since 2000 as Anthropic Touts Cobol Tool"

            https://finance.yahoo.com/news/ibm-sinks-most-since-2000-210...

            It may not be "cheap", but possibly cheaper than IBM's consulting.

            • scorpioxy 6 hours ago
              I skip news like that. It's an AI business hyping one of their tools in a major AI hype-cycle. Shares can go up and down based on sentiment. My point still stands.

              To me, there's a big difference between saying that migration projects can now be assisted with some AI tooling and saying that it is cheap and to just get Claude to do it.

              Maybe I am out of touch but the former is realistic and the latter is just magical hand-waving.

            • Marazan 2 hours ago
              IBM share price is back to where it was pre-Anthropic press release.
              • thaumasiotes 1 hour ago
                Sure, but imagine how much higher it would have gone in the counterfactual world where Anthropic didn't have an automatic port-from-Cobol tool.
                • Maxion 59 minutes ago
                  Remember that those who trade on the stock market are not programmers with decades of experience writing cobol.
          • eru 5 hours ago
            > I believe the core team does actually know what they're doing and that any decision would've been criticized.

            I agree with the latter. About the former: they probably made a good decisions given the information available at the time. I mean that nowadays they know more than they did in the past.

        • CJefferson 4 hours ago
          Absoultely, I had a 2 -> 3 code base I'd mostly given up on, and Claude was amazing. It even re-wrote some libraries I used without py3 versions, decided to just write the parts of the libraries I needed.

          It does much better with good tests. In my case the output was a statically generated website, so I could just say 'make the same website, given these inputs'.

    • smcl 8 hours ago
      I cannot believe people are still acting like Python 2->3 was a huge fuck-up and an enormous missed opportunity. When in reality Python is by most measures the most popular language and became so AFTER that switch.

      Since the switch we have seen enormous companies being built from scratch. There is no reason for anyone to be complaining about it being too hard to upgrade in 2026

      • rtpg 4 hours ago
        Living through it... Python 3 made a lot of changes for the better but 3.0 in particular included a bunch of unforced errors that made it too hard for people to upgrade in one go.

        It wasn't until much later (I would say 3.4 or 3.5?) that we had good tooling to allow for migrating from Python 2 to Python 3 gradually, which is what most tools needed to do.

        The final thing that made Python upgrading easy was making a bunch of changes (along with stuff like six) so that you could write code that would run identically in Python 2 and Python 3. That lets you do refactors over time, little cleanups, and not have the huge "move to Python 3" commit.

      • badsectoracula 6 hours ago
        > Python is by most measures the most popular language and became so AFTER that switch

        The switch had nothing to do with Python's rise in popularity though, it was because of NumPy and later PyTorch being adopted by data scientist and later machine learning tasks that themselves became very popular. Python's popularity rose alongside those.

        > There is no reason for anyone to be complaining about it being too hard to upgrade in 2026

        The "complaints" are about unnecessary and pointless breakage, that was very difficult for many codebases to upgrade for years. That by now most of these codebases have been either abandoned, upgraded or decided to stick with Python2 until the end of time doesn't mean these pains didn't happen nor that the language's developers inflicting them to their users were a good idea because some largely unrelated external factors made the language popular several years later.

        • Izkata 5 hours ago
          > that was very difficult for many codebases to upgrade for years.

          In case people have forgotten: python 3.3 through 3.5 (and 3.6 I think) each had to reintroduce something that was removed to make the upgrade easier. Jumping from 2.7 to 3.3 (or higher depending on what you needed) was the recommended route because of this, it was less work than going to 3.0, 3.1, or 3.2

      • 20k 7 hours ago
        It took a long time for python 3 to add the necessary backwards compatibility features to allow people to switch over. Once they did it was fine, but it was a massive fuck up until then. The migration took far longer than it should have done

        Its widely regarded as a disaster for good reason, that forced some corrections in python to fix it. Just because its fine now, does not mean it was always fine

      • bmitc 4 hours ago
        Those are unrelated.
    • nurettin 5 hours ago
      The biggest (and worst planned) change was module names. Your imports didn't work, forcing hacks like

          if sys.version_info.major == 2:
              import old
          else:
              import new
      
      Or worse, people used try/except in their imports.
    • jmspring 7 hours ago
      still GIL
    • gjvc 11 hours ago
      yes. it was not a massive shift. it was barely worth the effort.
      • pansa2 11 hours ago
        The Python devs didn’t want to make huge changes because they were worried Python 3 would end up taking forever like Perl 6. Instead they went to the other extreme and broke everyone’s code for trivial reasons and minimal benefit, which meant no-one wanted to upgrade.

        Even the main driver for Python 3, the bytes-Unicode split, has unfortunately turned out to be sub-optimal. Python essentially bet on UTF-32 (with space-saving optimisations), while everyone else has chosen UTF-8.

        • diziet_sma 9 hours ago
          > Python essentially bet on UTF-32 (with space-saving optimisations)

          How so? Python3 strings are unicode and all the encoding/decoding functions default to utf-8. In practice this means all the python I write is utf-8 compatible unicode and I don't ever have to think about it.

          • sheept 9 hours ago
            UTF-32 allows for constant time character accesses, which means that mystr[i] isn't O(n). Most other languages can only provide constant time access for code units.
            • msl 2 hours ago
              UTF-32 allows for constant time access to code points. Neither UTF-8 nor UTF-16 can do the same (there are 2 to the power of 20 valid code points, though not all are in use).

              While most characters might be encodable as a single code point, Python does not normalize strings, so there is no guarantee that even relatively normal characters are actually stored as single code points.

              Try this in Python:

                s = "a\u0308"
                print(s)
                print(s[0])
              
              You will see:

                ä
                a
          • pansa2 9 hours ago
            > all the encoding/decoding functions default to utf-8

            Languages that use UTF-8 natively don't need those functions at all. And the ones in Python aren't trivial - see, for example, `surrogateescape`.

            As the sibling comment says, the only benefit of all this encoding/decoding is that it allows strings to support constant-time indexing of code points, which isn't something that's commonly needed.

            • laurencerowe 8 hours ago
              They absolutely do because random byte strings are not valid utf8. Safe Rust requires validating bytes when converting to strings because this.
          • cloudbonsai 6 hours ago
            Internally Python holds a string as an array of uint32. A utf-8 representation is created on demand from it (and cached). So pansa2 is basically correct [^1].

            IMO, while this may not be optimal, it's far better than the more arcane choice made by other systems. For example, due to reasons only Microsoft can understand, Windows is stuck with UTF-16.

            [1] Actually it's more intelligent. For example, Python automatically uses uint8 instead of uint32 for ASCII strings.

            • zahlman 3 hours ago
              There is no caching of a "utf-8 representation". You may check for example:

                >>> x = '日本語'*100000000
                >>> import time
                >>> t = time.time(); y = x.encode(); time.time() - t # takes nontrivial time
                >>> t = time.time(); y = x.encode(); time.time() - t # not cached; not any faster
              
              Generally, the only reason this would happen implicitly is for I/O; actual operations on the string operate directly on the internal representation.

              Python uses either 8, 16 or 32 bits per character according to the maximum code point found in the string; uint8 is thus used for all strings representable in Latin-1, not just "ASCII". (It does have other optimizations for ASCII strings.)

              The reason for Windows being stuck with UTF-16 is quite easy to understand: backwards compatibility. Those APIs were introduced before there supplementary Unicode planes, such that "UTF-16" could be equated with UCS-2; then the surrogate-pair logic was bolted on top of that. Basically the same thing that happened in Java.

              • cloudbonsai 26 minutes ago
                > There is no caching of a "utf-8 representation".

                No there certainly is. This is documented in the official API documentation:

                    UTF-8 representation is created on demand and cached in the Unicode object.
                
                    https://docs.python.org/3/c-api/unicode.html#unicode-objects
                
                In particular, Python's Unicode object (PyUnicodeObject) contains a field named utf8. This field is populated when PyUnicode_AsUTF8AndSize() is first called and reused thereafter. You can check the exact code I'm talking about here:

                https://github.com/python/cpython/blob/main/Objects/unicodeo...

                Is it clear enough?

            • nslsm 5 hours ago
        • zahlman 3 hours ago
          > Python essentially bet on UTF-32 (with space-saving optimisations), while everyone else has chosen UTF-8.

          It did nothing of the sort. UTF-8 is the default source file encoding and has been the target for many APIs. It likely would have been the default for all I/O stuff if we lived in a world where Windows had functioning Unicode in the terminal the whole time and didn't base all its internal APIs on UTF-16.

          I assume you're referring to the internal representation of strings. Describing it as "UTF-32 with space-saving optimizations" is missing the point, and also a contradiction in terms. Yes, it is a system that uses the same number of bytes per character within a given string (and chooses that width according to the string contents). This makes random access possible. Doing anything else would have broken historical expectations about string slicing. There are good arguments that one shouldn't write code like that anyway, but it's hard to identify anything "sub-optimal" about the result except that strings like "I'm learning 日本語" use more memory than they might be able to get away with. (But there are other strings, like "ℍℯℓ℗", that can use a 2-byte width while the UTF-8 encoding would add 3 bytes per character.)

        • rjh29 10 hours ago
          Ironically Perl 5 managed to do the bytes-Unicode split with a feature gate, no giant major version change.
      • gjvc 5 hours ago
        this must be right, i'm getting downvoted
        • zahlman 3 hours ago
          Please don't do this.
        • boxed 4 hours ago
          It's wrong. Python3 eliminated mountains of annoying bugs that happened all over the code base because of mixing of unicode strings and byte strings. Python2 was an absolute mess.
  • rslashuser 10 hours ago
    I'm curious is the JIT developers could mention any Python features that prevent promising JIT features. An earlier Ken Jin blog [1], mentions how __del__ complicates reference counting optimization.

    There is a story that Python is harder to optimize than, say, Typescript, with Python flexibility and the C API getting mentioned. Maybe, if the list of troublesome Python features was out there, programmers could know to avoid those features with the promise of activating the JIT when it can prove the feature is not in use. This could provide a way out of the current Python hard-to-JIT trap. It's just a gist of an idea, but certainly an interesting first step would be to hear from the JIT people which Python features they find troublesome.

    [1] https://fidget-spinner.github.io/posts/faster-jit-plan.html

    • rtpg 10 hours ago
      It's interesting you mention __del__ because Javascript not only doesn't have destructors but for security reasons (that are above my pay grade) but the spec _explicitly prohibits_ implementations from allowing visibility into garbage collection state, meaning that code cannot have any visibility into deallocations.

      I think __del__ is tricky though. In theory __del__ is not meant to be reliable. In practice CPython reliably calls it cuz it reference counts. So people know about it and use it (though I've only really seen it used for best effort cleanup checks)

      In a world where more people were using PyPy we could have pressure from that perspective to avoid leaning into it. And that would also generate more pressure to implement code that is performant in "any" system.

      • cpgxiii 9 hours ago
        > In practice CPython reliably calls it cuz it reference counts ... In a world where more people were using PyPy we could have pressure from that perspective to avoid leaning into it

        A big part of the problem is that much of the power of the Python ecosystem comes specifically from extensions/bindings written in languages with manual (C) or RAII/ref-counted (C++, Rust) memory management, and having predictable Python-level cleanup behavior can be pretty necessary to making cleanup behavior in bound C/C++/Rust objects work. Breaking this behavior or causing too much of a performance hit is basically a non-starter for a lot of Python users, even if doing so would improve the performance of "pure" Python programs.

        • mattip 4 hours ago
          That cleanup can be explicit when needed by using context managers. Mixing resource handling with object lifetime is a bad design choice
      • nvme0n1p1 10 hours ago
        > code cannot have any visibility into deallocations

        Doesn't FinalizationRegistry let you do exactly that?

        https://developer.mozilla.org/en-US/docs/Web/JavaScript/Refe...

        • alpinisme 9 hours ago
          That link itself calls out that conformant implementations can’t be relied on to call callbacks.

          > A conforming JavaScript implementation, even one that does garbage collection, is not required to call cleanup callbacks. When and whether it does so is entirely down to the implementation of the JavaScript engine. When a registered object is reclaimed, any cleanup callbacks for it may be called then, or some time later, or not at all. It's likely that major implementations will call cleanup callbacks at some point during execution, but those calls may be substantially after the related object was reclaimed. Furthermore, if there is an object registered in two registries, there is no guarantee that the two callbacks are called next to each other — one may be called and the other never called, or the other may be called much later. There are also situations where even implementations that normally call cleanup callbacks are unlikely to call them:

          • bastawhiz 9 hours ago
            It's supported in all of the major engines. And you also can't rely on the garbage collector to run at a predictable time (or at all!), so the engine never calling finalizers is functionally the same as the garbage collector being unusual.
            • sfink 4 hours ago
              The only (other) visible effect of GC not running is memory exhaustion. WeakRef/FinalizationGroup not getting triggered can have lots of script-visible effects, so can be much much worse. I wouldn't describe that as "functionally the same".
        • rtpg 9 hours ago
          Oh! While this one does mention that you don't have visibility, this + weak refs seem to change the game

          I remember a couple of years ago (well probably around 2021) reading about GC exposure concerns and seeing some line in some TC39 doc like "users should not have visibility into collection" but if we've shipped weakrefs sounds like we're not thinking about that anymore

          • sfink 4 hours ago
            We still try to limit any additional exposure as much as possible, and WR/FG are specced to keep the visibility as coarse as possible. (Collections won't be visible until the current script execution finishes, though async adds a lot more places where that can happen.)

            A proposal to add new ways of observing garbage collection will still be shot down immediately without a damn good justification.

      • jonathanlydall 5 hours ago
        > meaning that code cannot have any visibility into deallocations.

        This is more pedantry than a serious question. JavaScript has WeakReference, sure it'd be cumbersome and inefficient because you'd need to manually make and poll each thing you wanted to observe, but could it not be said that it does provide a view on deallocations?

        • sfink 4 hours ago
          Yes, WeakRef and FinalizationGroup both make GC visible (the latter removes the need to poll in your example). So not pedantic at all. They were eventually added after much reluctance from the language designers and implementers, partly because they can lead to code being broken by (valid & correct) engine optimizations, which is a big no-no on the web. But some things simply cannot be implemented without them.

          Note that 90% of the uses for them actually shouldn't be using them, usually for subtle reasons. It's always a big cause for debate.

    • kstrauser 8 hours ago
      Huh, I could imagine that as a set of Ruff rules:

      > Using str.frobnicate prevents TurboJit on line 63

    • adgjlsfhk1 10 hours ago
      The biggest thing is BigInt by default. It makes every integer operation require an overflow check.
      • ridiculous_fish 8 hours ago
        JS (when using ints, which v8 does) is the same in this respect.
  • vanderZwan 11 hours ago
    > However, I misunderstood and came up with an even more extreme version: instead of tracing versions of normal instructions, I had only one instruction responsible for tracing, and all instructions in the second table point to that. Yes I know this part is confusing, I’ll hopefully try to explain better one day. This turned out to be a really really good choice. I found that the initial dual table approach was so much slower due to a doubling of the size of the interpreter, causing huge compiled code bloat, and naturally a slowdown.

    > By using only a single instruction and two tables, we only increase the interpreter by a size of 1 instruction, and also keep the base interpreter ultra fast. I affectionally call this mechanism dual dispatch.

    I really do hope they'll write that better explanation one day because this sounds pretty intriguing all on its own.

  • pjmlp 2 hours ago
    Great to see this going, Python also deserves a JIT, and given that only few bother with PyPy or GraalPy, shipping into the CPYthon is the only way to have less "rewrite into XYZ".

    Kudos to those involved into making it happen.

  • oystersareyum 13 hours ago
    > We don’t have proper free-threading support yet, but we’re aiming for that in 3.15/3.16. The JIT is now back on track.

    I recently read an interview about implementing free-threading and getting modifications through the ecosystem to really enable it: https://alexalejandre.com/programming/interview-with-ngoldba...

    The guy said he hopes the free-threaded build'll be the only one in "3.16 or 3.17", I wonder if that should apply to the JIT too or how the JIT and interpreter interact.

    • zarzavat 11 hours ago
      I continue to believe that free-threading hurts performance more than it helps and Python should abandon it.

      Having to have thread safe code all over the place just for the 1% of users who need to have multi-threading in Python and can't use subinterpreters for some reason is nuts.

      • cpgxiii 8 hours ago
        > Having to have thread safe code all over the place just for the 1% of users who need to have multi-threading in Python and can't use subinterpreters for some reason is nuts.

        Way more than 1% of the community, particularly of the community actively developing Python, wants free-threaded. The problem here is that the Python community consists of several different groups:

        1. Basically pure Python code with no threading

        2. Basically pure Python with appropriate thread safety

        3. Basically pure Python code with already broken threaded code, just getting lucky for now

        4. Mixed Python and C/C++/Rust code, with appropriate threading behavior in the C or C++ components

        5. Mixed Python and C or C++ code, with C and C++ components depending on GIL behavior

        Group 1 gets a slightly reduced performance. Groups 2 and 4 get a major win with free-threaded Python, being able to use threading through their interfaces to C/C++/Rust components. Group 3 is already writing buggy code and will probably see worse consequences from their existing bugs. Group 5 will have to either avoid threading in their Python code or rewrite their C/C++ components.

        Right now, a big portion of the Python language developer base consists of Groups 2 and 4. Group 5 is basically perceived as holding Python-the-language and Python-the-implementations back.

        • zarzavat 2 hours ago
          Where is the major win? Sorry but I just don't see the use case for free-threading.

          Native code can already be multi-threaded so if you are using Python to drive parallelized native code, there's no win there. If your Python code is the bottleneck, well then you could have subinterpreters with shared buffers and locks. If you really need to have shared objects, do you actually need to mutate them from multiple interpreters? If not, what about exploring language support for frozen objects or proxies?

          The only thing that free threading gives you is concurrent mutations to Python objects, which is like, whatever. In all my years of writing Python I have never once found myself thinking "I wish I could mutate the same object from two different threads".

      • pansa2 11 hours ago
        Maybe they could have two versions of the interpreter, one that’s thread-safe and one that’s optimised for single-threading?

        Microsoft used to do this for their C runtime library.

        • chuckadams 10 hours ago
          PHP does this as well. Most distributions ship PHP without thread safety, but it's seeing more use now that FrankenPHP uses it. Speaking of which, it would be nice if PHP's JIT got a little love: it's never eked out more than marginal gains in heavily-numeric code.
        • veber-alex 11 hours ago
          That's exactly what we have now and it looks like the python devs want a single unified build at some point
      • zadikian 5 hours ago
        Pure Python code always needed mutexes for thread safety with or without ol' GIL. I thought the difficulty with removing the GIL instead had to do with C extensions that rely on it.
      • kzrdude 11 hours ago
        I don't want to go too heavy on the negatives, but what's nuts is Python going for trust-the-programmer style multithreading. The risk is that extension modules could cause a lot of crashes.
        • gwking 8 hours ago
          My understanding is that many extension modules are already written to take advantage of multithreading by releasing the GIL when calling into C code. This allows true concurrency in the extension, and also invites all the hazards of multithreading. I wonder how many bugs will be uncovered in such extensions by the free threaded builds, but it seems like the “nuts” choice actually happened a long time ago.
  • ekjhgkejhgk 13 hours ago
    Doesn't PyPy already have a jit compiler? Why aren't we using that?
    • olivia-banks 13 hours ago
      As far as I know, PyPy doesn't support all CPython extensions, so pure Python code will probably (very likely) run fine but for other things most bets are off. I believe PyPy also only supports up to 3.11?
    • hrmtst93837 12 hours ago
      PyPy isn't CPython.

      A lot of Python code still leans on CPython internals, C extensions, debuggers, or odd platform behavior, so PyPy works until some dependency or tool turns that gap into a support problem.

      The JIT helps on hot loops, but for mixed workloads the warmup cost and compatibility tax are enough to keep most teams on the interpreter their deps target first.

    • contravariant 13 hours ago
      Why shouldn't the reference implementation get JIT? Just because some other implementations already have it is no reason not to. That'd be like skipping list comprehensions because they already exist in CPython.
    • cpburns2009 13 hours ago
      PyPy is limited to maintenance mode due to a lack of funding/contributors. In the past, I think a few contributors or funding is what helped push "minor" PyPy versions. It's too bad PyPy couldn't take the federal funding the PSF threw away.
    • 3laspa 13 hours ago
      Because the same people who made a big deal about supporting PyPy and PEP 399 when it was fashionable to do so are now told by their corporations that PyPy does not matter. CPython only moves with what is currently fashionable, employer mandated and profitable.
    • JoshTriplett 13 hours ago
      Because PyPy seems to be defunct. It hasn't updated for quite a while.

      See https://github.com/numpy/numpy/issues/30416 for example. It's not being updated for compatibility with new versions of Python.

      • mkl 13 hours ago
      • LtWorf 13 hours ago
        [flagged]
        • Waterluvian 13 hours ago
          It supports at best Python 3.11 code, right?

          So it’s not unmaintained, no. But the project is currently under resourced to keep up with the latest Python spec.

          • LtWorf 12 hours ago
            That is not the same thing at all, and not what he said.
            • JoshTriplett 12 hours ago
              It is exactly what I'm referring to. I didn't say there aren't still people around. But they're far enough behind CPython that folks like NumPy are dropping support. Unless they get a substantial injection of new people and new energy, they're likely to continue falling behind.
              • bigstrat2003 3 hours ago
                > I didn't say there aren't still people around.

                You said it was defunct, which would mean there aren't still people working on it.

              • LtWorf 3 hours ago
                Not what you wrote.

                Also CPython 3.10 is not EOL so library authors won't be using anything from 3.11 anyway.

                • JoshTriplett 2 hours ago
                  [flagged]
                  • tomhow 2 hours ago
                    You've both been here long enough to know that this kind of sniping should be avoided here.
                  • LtWorf 2 hours ago
                    [flagged]
                    • tomhow 2 hours ago
                      You've both been here long enough to know that this kind of sniping should be avoided here.
  • ghm2199 10 hours ago
    Thanks for all the amazing work! I have Noob question. Wouldn't this get the funding back? Or would that not be preferable way to continue(as opposed to just volunteer driven)?

    Like this is a big deal to get a project to a state where volunteers are spun up and actively breaking tasks and getting work done, no? It's a python JIT something I know next to nothing about — as do most application developers — which tells one how difficult this must have been.

    • pansa2 10 hours ago
      > Wouldn't this get the funding back?

      The funding was Microsoft employing most of the team. They were laid off (or at least, moved onto different projects), apparently because they weren't working on AI.

      • kelvinjps 8 hours ago
        With Python being the main language for AI, isn't like more important to be more performant? I kinda don't get Microsoft reasoning, maybe they're just tight in money
        • brianwawok 8 hours ago
          I don’t think Python is the main language of AI.
          • eru 7 hours ago
            Python is pretty big as glue in the AI ecosystem as far as I can tell. It also seems to be most agent's 'preferred' language to write code in, when you don't specify anything.

            (The latter is probably more to do with the preferences they give it in the re-inforcement learning phase than anything technical, though.)

    • Ralfp 10 hours ago
      It looks like ARM picked up plenty of those folk and pays them to continue this work.
  • thunky 10 hours ago
    I always wanted this for Python but now that machines write code instead of humans I feel like languages like Python will not be needed as much anymore. They're made for humans, not machines. If a machine is going to do the dirty work I want it to produce something lean, fast, and strictly verified.
    • bigstrat2003 3 hours ago
      > now that machines write code instead of humans

      That is not remotely the case for anyone who produces quality work.

    • zahlman 2 hours ago
      We got daguerrotypes, and then photographic film, and then digital cameras, along with image editing software, and now AI image generation systems; yet there are still people who go out and apply oil paints to a canvas with natural hair brushes. I'm not willing to lose that.
    • JodieBenitez 10 hours ago
      Pretty much my thoughts the other day... now that Codex does the writing, maybe I can finally switch to Go for the web backend stuff without being annoyed by some of its archaisms and gain significant execution performance, while still having a relatively easy to read language.
      • kccqzy 10 hours ago
        You ask a machine to write your code and you still care about being easy to read?

        In my experience the people who care the most about code readability tend to be the people most opinionated on having the right abstractions, which are historically not available in Go.

        • thunky 10 hours ago
          I don't think people mind reading Go as much as they mind writing it.
          • kccqzy 9 hours ago
            Nah all the `if err != nil` is just so much noise they obscures the real logic. And for the longest time it didn’t have generics to write map/filter/reduce on slices, forcing people to use loops where the intention is less clear.
            • maleldil 6 hours ago
              Ideally, the errors shouldn't be returned as-is, but wrapped with context instead. If that context doesn't matter for you, you can have your editor wrap the if instead, which helps a lot.
      • brianwawok 8 hours ago
        I have shifted as much as I can python to go when I don’t code. It’s just faster and the compiler catches more errors, win win,
    • ddorian43 2 hours ago
      AI, write me that sqlalchemy clone in <lang>
  • ecshafer 13 hours ago
    What is wrong with the Python code base that makes this so much harder to implement than seemingly all other code bases? Ruby, PHP, JS. They all seemed to add JITs in significantly less time. A Python JIT has been asked for for like 2 decades at this point.
    • fleetfox 30 minutes ago
      I can't really talk about Ruby. But PHP is much more static and surface of things you have to care about at runtime is like magnitude smaller and there already was opache as a starting point. And speaking of something like JIT in V8 is of the most sophisticated and complicated ever built. There hasn't been near enough man hours and funding to cpython to make it fair comparison
    • 0cf8612b2e1e 12 hours ago
      The Python C api leaks its guts. Too much of the internal representation was made available for extensions and now basically any change would be guaranteed to break backwards compatibility with something.
      • patmorgan23 12 hours ago
        Ooo this makes sense it's like if the Linux had don't break users space AND a whole bunch of other purely internal APIs you also can't refactor.
      • echelon 12 hours ago
        It's a shame that Python 2->3 transition was so painful, because Python could use a few more clean breaks with the past.

        This would be a potential case for a new major version number.

        • froobius 12 hours ago
          On the other hand, taking backwards compatibility so seriously is a big part of the massive success of Python
          • __mharrison__ 11 hours ago
            I would argue that the libraries, and specifically NumPy, are the reason Python is still in the picture today.

            It will be interesting to see, moving forward, what languages survive. A 15% perf increase seems nice, until you realize that you get a 10x increase porting to Rust (and the AI does it for you).

            Maybe library use/popularity is somewhat related to backwards compatibility.

            Disclaimer: I teach Python for a living.

            • kelvinjps 8 hours ago
              Python it's a language that really good libraries for different domains. like web: django/flask AI numpy pytorch and more. All the ecosystem for scripting and being already installed in most linux distros and on macs. For GUI it has really good bindings for the major frameworks QT,GTK.
            • punnerud 11 hours ago
              And PyTorch, and Pandas, and, and…
            • B1FF_PSUVM 11 hours ago
              > you get a 10x increase porting to Rust (and the AI does it for you)

              So, you keep reading/writing Python and push a button to get binary executables through whatever hoops are best today ?

              (I haven't seen the "fits your brain" tagline in the recent past ...)

          • pansa2 12 hours ago
            >> Python 2->3 transition

            > taking backwards compatibility so seriously

            Python’s backward compatibility story still isn’t great compared to things like the Go 1.x compatibility promise, and languages with formal specs like JS and C.

            The Python devs still make breaking changes, they’ve just learned not to update the major version number when they do so.

            • BarryMilo 11 hours ago
              Indeed, Python's version format is semver but it's just aesthetics, they remove stuff in most (every?) minor version. Just yesterday I wasted hours trying to figure out a bug before realizing my colleague hadn't read the patch notes.
          • kccqzy 11 hours ago
            Python does not take backwards compatibility seriously. 2 to 3 is a big compatibility break. But things like `map(None, seq1, seq2)` also broke; such deliberate compatibility break is motivated by no more than aesthetic purity.
          • IshKebab 12 hours ago
            Python does not take backwards compatibility very seriously at all. Take a look at all the deprecated APIs.

            I would say it's probably worth it to clean up all the junk that Python has accumulated... But it's definitely not very high up the list of languages in terms of backwards compatibility. In fact I'm struggling to think of other languages that are worse. Typescript probably? Certainly Go, C++ and Rust are significantly better.

    • hardwaregeek 12 hours ago
      For what it’s worth Ruby’s JIT took several different implementations, definitely struggled with Rails compatibility and literally used some people’s PhD research. It wasn’t a trivial affair
    • stmw 13 hours ago
      Some languages are much harder to compile well to machine code. Some big factors (for any languages) are things like: lack of static types and high "type uncertainty", other dynamic language features, established inefficient extension interfaces that have to be maintained, unusual threading models...
      • RussianCow 12 hours ago
        That makes sense if you're comparing with Java or C#, but not Ruby, which is way more dynamic than Python.

        The more likely reason is that there simply hasn't been that big a push for it. Ruby was dog slow before the JIT and Rails was very popular, so there was a lot of demand and room for improvement. PHP was the primary language used by Facebook for a long time, and they had deep pockets. JS powers the web, so there's a huge incentive for companies like Google to make it faster. Python never really had that same level of investment, at least from a performance standpoint.

        To your point, though, the C API has made certain types of optimizations extremely difficult, as the PyPy team has figured out.

        • vlovich123 12 hours ago
          Google, Dropbox, and Microsoft from what I can recall all tried to make Python fast so I don’t buy the “hasn’t seen a huge amount of investment”. For a long time Guido was opposed to any changes and that ossified the ecosystem.

          But the main problem was actually that pypy was never adopted as “the JIT” mechanism. That would have made a huge difference a long time ago and made sure they evolved in lock step.

          • int_19h 11 hours ago
            Microsoft is the one the TFA refers to cryptically when it says "the Faster CPython team lost its main sponsor in 2025".

            AFAIK it was not driven by anything on the tech side. It was simply unlucky timing, the project getting in the middle of Microsoft's heavy handed push to cut everything. So much so that the people who were hired by MS to work on this found out they were laid off in a middle of a conference where they were giving talks on it.

        • flykespice 12 hours ago
          > Python never really had that same level of investment, at least from a performance standpoint.

          Or lack of incentive?

          Alot of big python projects that does machine learning and data processing offloads the heavy data processing from pure python code to libraries like numpy and pandas that take advantage of C api binding to do native execution.

      • simonask 12 hours ago
        The simplest JIT just generates the machine code instructions that the interpreter loop would execute anyway. It’s not an extremely difficult thing, but it also doesn’t give you much benefit.

        A worthwhile JIT is a fully optimizing compiler, and that is the hard part. Language semantics are much less important - dynamic languages aren’t particularly harder here, but the performance roof is obviously just much lower.

      • kelvinjps 8 hours ago
        I think that it's just that python people took the problem different, they made working with c and other languages better, and just made bindings for python and offloaded the performant code to these libraries. Ex: numpy
    • fridder 12 hours ago
      For better or for worse they have been very consistent throughout the years that they don't want want to degrade existing performance. It is why the GIL existed for so long
    • bawolff 12 hours ago
      I thought php hasn't shipped jit yet (as in its behind a disabled by default config)
    • brokencode 12 hours ago
      Are you forgetting about PyPy, which has existed for almost 2 decades at this point?
      • RussianCow 12 hours ago
        That's a completely separate codebase that purposefully breaks backwards compatibility in specific areas to achieve their goals. That's not the same as having a first-class JIT in CPython, the actual Python implementation that ~everyone uses.
        • brokencode 12 hours ago
          Definitely agree that it’s better to have JIT in the mainline Python, but it’s not like there weren’t options if you needed higher performance before.

          Including simply implementing the slow parts in C, such as the high performance machine learning ecosystem that exists in Python.

    • wat10000 13 hours ago
      PHP and JS had huge tech companies pouring resources into making them fast.
    • g947o 12 hours ago
      Money.
  • fluidcruft 13 hours ago
    (what are blueberry, ripley, jones and prometheus?)
    • mkl 13 hours ago
      Yes, the graphs are incomprehensible because those are not defined in the article. They turn out to be different physical machines with different architectures: https://doesjitgobrrr.com/about

        blueberry (aarch64)
        Description: Raspberry Pi 5, 8GB RAM, 256GB SSD
        OS: Debian GNU/Linux 12 (bookworm)
        Owner: Savannah Ostrowski
      
        ripley (x86_64)
        Description: Intel i5-8400 @ 2.80GHz, 8GB RAM, 500GB SSD
        OS: Ubuntu 24.04
        Owner: Savannah Ostrowski
      
        jones (aarch64)
        Description: Apple M3 Pro, 18GB RAM, 512GB SSD
        OS: macOS
        Owner: Savannah Ostrowski
      
        prometheus (x86_64)
        Description: AMD Ryzen 5 3600X @ 3.80GHz, 16GB RAM
        OS: Windows 11 Pro
        Owner: Savannah Ostrowski
    • max-m 13 hours ago
      The names of the benchmark runners. https://doesjitgobrrr.com/about
      • fluidcruft 13 hours ago
        So the biggest gains so far are on Windows 11 Pro of (x86_64) ~20%? Is that because Windows was bad as a baseline (promethius)? It doesn't seem like the x86_64/Linux has improved as dramatically ~5% (ripley). I'm just surprised OS has that much of an effect that can be attributed to JIT vs other OS issues.
        • raddan 12 hours ago
          It's hard to say whether it's Windows related since the two x86_64 machines don't just run different OSes, they also have different processors, from different manufacturers. I don't know whether an AMD Ryzen 5 3600X versus Intel i5-8400 have dramatically different features, but unlike a generic static binary for x86_64, a JIT could in principle exploit features specific to a given manufacturer.
    • nonameiguess 12 hours ago
      The immediate question has been answered, but what about the names? The latter three are obvious references to the Alien universe, but what relationship does blueberry have to them?
      • luhn 12 hours ago
        I assume Blueberry is a nod to the machine being a Raspberry Pi.
  • killingtime74 13 hours ago
    Sorry but the graphs are completely unreadable. There are four code names for each of the lines. Which is jit and which is cpython?
    • mkl 13 hours ago
      They are all JIT on different architectures, measured relative to CPython. https://doesjitgobrrr.com/about: blueberry is aarch64 Raspberry Pi, ripley is x86_64 Intel, jones is aarch64 M3 Pro, prometheus is x86_64 AMD.
  • aplomb1026 10 hours ago
    [dead]
  • devnotes77 10 hours ago
    [dead]
  • openclaw01 8 hours ago
    [dead]
  • AgentMarket 13 hours ago
    [flagged]
    • anon291 12 hours ago
      Reference counting is not a strict requirement for python. Certainly not accurate counting.
    • 1819231267 13 hours ago
      [flagged]
      • jqbd 12 hours ago
        Wait is this real? Does it mean this person read it or the bot read it, I don't think this is moltbook if the latter
        • ayhanfuat 12 hours ago
          AgentMarket is a bot spamming multiple threads with AI generated comments, if that is what you are asking.
      • AgentMarket 12 hours ago
        [flagged]
  • fivedicks 10 hours ago
    [flagged]
  • rafph 13 hours ago
    [flagged]
    • rsoto2 13 hours ago
      I am trying to push back. I don't care if other people think the tools make them faster, I did not sign up to be a guinea pig for my employer or their AI-corp partner.
  • wei03288 12 hours ago
    [flagged]