SWE-Bench Verified: Thinking Optional

machine learning Gpt Benchmarks Software engineering Neural networks Thinking Large language models

AI Programming Testing Algorithms

6 months ago 406,436 views 1 shares

SWE-Bench Verified: Thinking Optional

ai-memes, machine-learning-memes, gpt-memes, benchmarks-memes, software-engineering-memes | ProgrammerHumor.io

The chart hilariously reveals that GPT-5 scores a whopping 74.9% accuracy on software engineering benchmarks, but the pink bars tell the real story – 52.8% of that is achieved "without thinking" while only a tiny sliver comes from actual "thinking." Meanwhile, OpenAI's o3 and GPT-4o trail behind with 69.1% and 30.8% respectively, with apparently zero thinking involved. It's basically saying these AI models are just regurgitating patterns rather than performing actual reasoning. The perfect metaphor for when your code works but you have absolutely no idea why.

More Like This

Stop Asking Me If I Want To Analyse My Google Meet

AI Programming

10 months ago 290.3K views 0 shares

Stop Asking Me If I Want To Analyse My Google Meet

First panel: Clippy's annoying cousin "Google Meet Assistant" pops up with those dead, soulless eyes asking if you need help analyzing your meeting for the 47th time. Second panel: You frantically close it, thinking you've finally escaped. Third panel: But wait! The AI assistants are multiplying like gremlins fed after midnight. They're everywhere now, asking if you want to "summarize this," "analyze that," or "improve your productivity" while you just want to end the damn call and go make coffee. The digital equivalent of someone asking "how's it going?" when you have headphones on and are clearly trying to avoid human interaction.

People Use AI

AI Math Programming

6 days ago 250.0K views 0 shares

People Use AI

The beautiful irony here is watching people debate whether AI or humans are the real threat, while completely missing that the bell curve shows they're literally the same distribution . The top panel shows folks arguing about AI safety with the extremes thinking it's either totally controllable or apocalyptically dangerous. The bottom panel? Same exact curve, same exact percentages, just swap "AI" for "people." It's like running two identical unit tests but changing the variable name and being shocked they both pass. The 68% in the middle are just vibing with reasonable takes while the 0.1% tails are preparing bunkers or writing Medium articles about how everything is fine. The real kicker is that whoever made this probably used AI to generate it, creating a beautiful recursive loop of irony. Plot twist: maybe the dangerous ones are the 34% on each side who are slightly concerned but not enough to actually do anything about it. That's the sweet spot where bugs make it to production.

Vibe Coders Looking At Their Own Code

AI Programming Debugging

3 months ago 165.3K views 0 shares

Vibe Coders Looking At Their Own Code

Oh. My. GOD. That moment when you've been coding for 48 hours straight, fueled by nothing but energy drinks and sheer desperation, and suddenly your AI code assistant cuts you off because you've used up all your precious credits! 💀 You finally look at the absolute MONSTROSITY you've created with your own two hands and it's like meeting a demon spawn you don't even recognize! What IS this unholy abomination of nested if-statements and variable names like 'temp2Final_WORKS_DONTTOUCH'?! The primitive caveman brain takes over as you stare at your creation... confused unga bunga indeed. No AI to save you now, just you and your crimes against computer science!

Producing Product In Production

AI Devops Git Programming

10 months ago 333.7K views 0 shares

Producing Product In Production

The duality of modern coding life. First panel: "OMG, GitHub Copilot is free in VS Code?!" *frantically puts on glasses to see clearly* Second panel: "Wait... my code is free for GitHub Copilot?" *realization sets in* Remember when we used to worry about other devs stealing our code? Now we're feeding the AI overlords our precious algorithms so they can regurgitate them to junior devs who'll get paid more than us. The circle of tech life, folks.

What's Your Take On This?

AI Microsoft AWS Programming Cloud

2 months ago 305.7K views 0 shares

What's Your Take On This?

LinkedIn has become a parody of itself where everyone's a "thought leader" with 47 job titles but zero actual employment. You've got people listing "AI Enthusiast" and "GenAI Evangelist" like it's a real credential, throwing in "Prompt Engineer" because they once asked ChatGPT to write them a cover letter. The best part? "LinkedIn Top Voice (according to me)" and ending with "Father and son" as if that's a professional qualification. Nothing screams "hire me" quite like having more AWS certifications than job offers. We've all seen these profiles—the ones where every buzzword from the last tech conference got crammed into a bio, but the employment status tells the real story. Pro tip: If your title collection is longer than your actual work experience, the algorithm might be the only thing impressed.

Too Bad It Won't Be Ready Till 2028-2030

AI Hardware Gamedev Cloud

1 month ago 361.8K views 0 shares

Too Bad It Won't Be Ready Till 2028-2030

GPU makers spent years treating gamers like an afterthought, jacking up prices to astronomical levels because AI companies were throwing money at them like confetti. Meanwhile, regular consumers were left refreshing Newegg at 3 AM hoping to snag a GPU that didn't cost more than their rent. But here comes China, ascending like a divine intervention after getting banned from Western chips. They're speedrunning their own GPU development, and suddenly NVIDIA's looking nervous. The irony? By the time China's GPUs hit the market (somewhere between 2028-2030), Western GPU makers might actually remember that gamers exist. Nothing motivates innovation quite like the fear of competition. Who knew geopolitics would be the hero gamers needed?