๐Ÿก


to read (pdf)

  1. I don't want your PRs anymore
  2. JitterDropper | OALABS Research
  3. DomainTools Investigations | DPRK Malware Modularity: Diversity and Functional Specialization
  4. EXHIB: A Benchmark for Realistic and Diverse Evaluation of Function Similarity in the Wild
  5. Neobrutalism components - Start making neobrutalism layouts today

  1. May 28, 2026
    1. ๐Ÿ”— BarutSRB/OmniWM OmniWM v0.4.9.6 release

      What's New Since 0.4.9.5

      Window Lifecycle and Restore

      • Fixed browser fullscreen placeholder cleanup and fake-window gating during fullscreen restore.
      • Restored persisted windows after relaunch, including better monitor/workspace recovery.
      • Preserved tiled parent relationships for floating child windows.
      • Fixed resize placeholder fallback behavior when AX refuses size writes.
      • Fixed stale floating focus border cleanup and matched border corner radius more accurately.

      Input and Navigation

      • Added configurable Hyper and leader-key hotkey sequences.
      • Routed Overview focus hotkeys through modal selection.

      Layout and Interface Polish

      • Smoothed Dwindle animation retargeting.
      • Removed the Niri row cap setting.
      • Fixed workspace bar height clamping.
      • Fixed tabbed overlay window layering.

      Runtime State and Storage

      • Moved Quake frame state and command palette mode out of settings and into runtime state.
      • Fixed XDG storage paths for private state.

      Sponsors

      Release Integrity

      • OmniWM-v0.4.9.6.zip contains the Developer ID signed, notarized, and stapled OmniWM app.
      • OmniWM-v0.4.9.6.zip SHA-256: d17867d5e3eb321da549523d3d029c9f1180ae551cbff5e32fe2641759410bf2
      • GhosttyKit.xcframework-v0.4.9.6.zip SHA-256: 2118860c992fd13a1aee5d7f9a7d6322edc41d94b97a4a3f9ca7846ac0f3c637
    2. ๐Ÿ”— Console.dev newsletter Files SDK rss

      Description: Unified SDK for object and blob stores.

      What we like: Consistent API for whichever backing storage you use: S3, R2, Dropbox, Supabase etc. Supports core operations e.g. upload, copy, move, list. Every method is also a command with the built-in CLI. Hooks for onAction, onRetry, onError.

      What we dislike: JS / TS-only.

    3. ๐Ÿ”— Console.dev newsletter Taphouse rss

      Description: Native macOS Homebrew GUI.

      What we like: Visual package browse and search. Manage services that ship with packages e.g. start/stop Redis, Postgres, etc. Easy to install and remove packages with a global view of everything installed. Lists disk usage and provides security audit alerts.

      What we dislike: Free version provides most of the functionality, but paid upgrade required for some features e.g. package health dashboard. macOS only.

  2. May 27, 2026
    1. ๐Ÿ”— earendil-works/pi v0.76.0 release

      New Features

      • Explicit session IDs for automation - --session-id <id> lets scripts create or resume an exact project-local session. See Sessions.
      • RPC bash output can stay out of model context - RPC clients can pass excludeFromContext to bash for commands whose output should not be sent with the next prompt. See RPC mode.
      • More predictable provider retries and timeouts - Codex WebSocket/SSE waits are bounded, and retry.provider.maxRetries controls provider retries instead of hidden SDK defaults. See Retry settings.
      • Better terminal editing across environments - Apple Terminal Shift+Enter, Windows/JetBrains capability detection, and Unicode-aware word navigation improve interactive editing. See Terminal setup and Keybindings.

      Added

      • Added --session-id to let CLI callers use an exact project-local session ID, creating it if missing (#4874).
      • Added excludeFromContext flag to the bash RPC command for parity with the internal executeBash API (#5039).

      Fixed

      • Fixed user message transcript rendering to preserve user-authored ordered-list markers (#5013).
      • Fixed self-update commands to bypass npm, pnpm, and Bun minimum release age gates for explicit pi update runs (#4929).
      • Fixed context token estimates to count user image attachments consistently with tool result images (#4983).
      • Fixed httpIdleTimeoutMs to apply to OpenAI Codex Responses WebSocket idle waits, added websocketConnectTimeoutMs for bounded WebSocket connect waits, and added a 10s Codex SSE response-header timeout (#4945).
      • Fixed RpcClient to reject pending requests and consume stdin pipe errors when the child process exits unexpectedly (#4764).
      • Fixed managed npm extension updates to avoid package managers installing or resolving pi host packages as peer dependencies (#4907).
      • Fixed RPC mode raw stdout writes to retry transient backpressure errors and flush queued protocol output during shutdown (#4897).
      • Fixed OpenAI Codex Responses cache-affinity headers to send session-id instead of proxy-incompatible session_id (#4967).
      • Fixed openai-codex/gpt-5.3-codex-spark model metadata to use its 128k context window (#4969).
      • Fixed OpenRouter/Poolside context overflow detection for maximum allowed input length errors (#4943).
      • Fixed provider retry controls so retry.provider.maxRetries is honored, SDK retries default to 0, and quota/billing 429s are not retried behind Pi's retry handling (#4991 by @mitsuhiko).
      • Fixed Apple Terminal Shift+Enter by detecting local macOS modifier state when Terminal.app sends plain Return.
      • Fixed Windows Terminal capability detection to enable OSC 8 hyperlinks, preserving clickable long URLs across wrapped lines (#4923).
      • Fixed JetBrains terminal capability detection to enable truecolor while disabling unsupported OSC 8 hyperlinks (#5037 by @Perlence).
      • Fixed editor and input word navigation/deletion to use Unicode word boundaries while preserving ASCII punctuation boundaries (#5022 by @haoqixu, #5067 by @haoqixu, #5068 by @haoqixu).
    2. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc25 release

      Full Changelog : v0.9.32rc24...v0.9.32rc25

    3. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc24 release

      Full Changelog : v0.9.32rc23...v0.9.32rc24

    4. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc23 release

      Full Changelog : v0.9.32rc22...v0.9.32rc23

    5. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc22 release

      Full Changelog : v0.9.32rc21...v0.9.32rc22

    6. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc21 release

      Full Changelog : v0.9.32rc20...v0.9.32rc21

    7. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc20 release

      Full Changelog : v0.9.32rc19...v0.9.32rc20

    8. ๐Ÿ”— r/Yorkshire Filey today, Yorkshire's traditional beach holiday resorts are flying! rss
    9. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc19 release

      Full Changelog : v0.9.32rc18...v0.9.32rc19

    10. ๐Ÿ”— r/LocalLLaMA Behold! Probably the most ghetto local AI server: rss

      Behold! Probably the most ghetto local AI server: | AKA: Jank Incarnate After months of pain, I finally got a working setup. There's a bunch of quirks about running a multi-Tesla setup. I was planning to write something about my experience after I get it running. Currently, the fans are plugged into the wall, speed is controlled with a knob. I still gotta wire up a PWM controller for them. EDIT: Specs:

      • Intel Xeon CPU E5-2680 v4 @ 2.40GHz
      • Asrocka x99 Extreme motherboard
      • Cursed 16GB DDR4 of some laptop SODIMM in an adapter
      • 3x Nvidia Tesla V100, 32GB - total 96GB of VRAM

      submitted by /u/MackThax
      [link] [comments]
      ---|---

    11. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc18 release

      Full Changelog : v0.9.32rc17...v0.9.32rc18

    12. ๐Ÿ”— ArchiveBox/ArchiveBox v0.9.32rc17 release

      What's Changed

      Full Changelog : v0.9.31-rc...v0.9.32rc17

    13. ๐Ÿ”— r/Leeds Good article from Pellicle on the Leeds independent pub/brewery scene rss
    14. ๐Ÿ”— Simon Willison I think Anthropic and OpenAI have found product-market fit rss

      Anthropic are strongly rumored to be about to have their first profitable quarter. Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff. I think this is because OpenAI and Anthropic have both found product-market fit.

      Enterprise customers are now paying API prices

      I currently subscribe to the $100/month Max plan from Anthropic and the $100/month Pro plan from OpenAI. If you are a heavy user of coding agents these plans are a fantastic deal. I just ran the ccusage tool on my laptop to get an estimate of how much I would have spent if I were to pay for API tokens in the past 30 days and got:

      • $1,199.79 for Anthropic Claude Code
      • $980.37 for OpenAI Codex

      That's $2,180.16 worth of tokens for $200 - not bad at all! I'm a moderately heavy user of these tools, but I'm certainly not running agents every hour of the day and night.

      I had assumed that companies making extensive use of agents were getting similar discounts. It turns out I could not have been more wrong about that.

      I haven't been able to track down the exact date, but at some point in the last six months Anthropic switched their Enterprise plan (originally "Claude seats include enough usage for a typical workday" back in August 2025) to $20/seat/month plus API pricing for usage. This story about the change from The Information is dated Apr 14, 2026, but cites an Anthropic spokesperson claiming that the pricing change occurred in November 2025. Existing customers are finding out about the change as they renew their contracts.

      OpenAI made a similar pricing change in April. The Codex rate card (Internet Archive copy) currently says:

      Note: On April 2, 2026, we updated Codex pricing to align with API token usage, instead of per-message pricing. This change was applicable to new and existing Plus, Pro, ChatGPT Business and new ChatGPT Enterprise plans.

      On April 23, 2026, we made this update for all existing ChatGPT Enterprise plans as well, inclusive of Edu, Health, Gov, and ChatGPT for Teachers.

      It's a little harder to decode as they quote prices in "credits", but as far as I can tell those credit costs are an exact match for the API token costs listed for those models.

      All of which is to say that as of April 2026 the "Enterprise" cost for both OpenAI Codex and Anthropic Claude Code/Cowork is the same as the listed API price.

      GPT-5.5 (released April 23rd) is 2x the API price of GPT-5.4. Opus 4.7 (April 16th) is around 1.4x the price of Opus 4.6 when you take their new tokenizer into account.

      So April saw both leading model companies release new frontier models with a higher API price, and both companies now have measures to lock their enterprise customers (who tend to sign year-long deals) at those API prices, not the previous extreme discounts.

      I think they've found product-market fit

      Why these sudden aggressive moves on pricing? Both Anthropic and OpenAI are planning to IPO, but I suspect there's a more important factor here: I think they've finally found product-market fit, with the coding/general-purpose agent products embodied by Claude Code/Cowork and Codex.

      Tools like ChatGPT are wildly popular, but that wild popularity has been difficult to turn into revenue. In February OpenAI boasted more than 900 million weekly active users for ChatGPT, but only 50 million - 5.6% of that - were paying consumer subscribers.

      Charging $10-$20/month per user is an OK business, but you'd need 1-2 billion subscribers sticking around for four years to cover $1 trillion in infrastructure.

      Companies spending $200+/month/user will get you there a whole lot faster - and as noted above, as a power-user I'm at ~$1,000/month in API costs per vendor already.

      Coding agents really did change everything. These are tools which burn vastly more tokens, but are also quickly becoming daily drivers for the work carried out by extremely well-compensated professionals. Right now that's still mostly software engineers, but a coding agent is a tool that can automate anything you can do by typing commands into a computer... so they are clearly applicable to a much wider set of skilled knowledge workers.

      As I've discussed on this site at length, the models released in November 2025 elevated agents to being genuinely useful. We've had six months to get used to that idea now - it's no wonder companies are beginning to spend real money on this technology.

      You could argue that ChatGPT achieved product-market fit when it became the fastest-growing consumer app in history back in February 2023... but it certainly wasn't making any actual money back then. Coding agents plus enterprise pricing marks the point when these companies start making very real revenue. Maybe even enough to start covering their costs!

      And they're ramping up

      As further evidence that enterprise agents represent product-market fit for these companies, consider their open job listings.

      OpenAI have 703 open jobs right now, of which I'd categorize 229 (32.6%) as relating to enterprise sales and support - account executives, "Go To Market", "Forward Deployed Engineers" and the like.

      Anthropic have 390 open jobs, 105 (26.9%) of which look enterprisey to me.

      It's pleasingly ironic that these AI labs have picked a business model with such a heavy demand on human labor - enterprise sales contracts don't close themselves without a whole lot of humans in the mix!

      (I ran this analysis by scraping their job sites with Claude Code, then having it use Datasette's JSON API to pipe that data into Datasette Cloud where I used Datasette Agent for the analysis, exported here. Dogfood!)

      The AI-failure stories around this are pretty thin

      I started digging into this in response to a growing volume of stories claiming that large companies were sounding the alarm because their AI usage costs had grown so large.

      The most widely cited of these stories appear quite overblown to me.

      The most discussed has been Uber, based on this report where CTO Praveen Neppalli Naga indicated that Uber had "maxed out its full year AI budget just a few months into 2026", mostly thanks to Claude Code.

      Given that Claude Code only got really good in November it's entirely unsurprising to me that a budget set in 2025 may have failed to predict demand for that tool in 2026!

      That Uber story was further fueled by comments made by Uber's COO, Andrew Macdonald, on the Rapid Response podcast. I tracked down the segment and there really isn't much there. Here's what Andrew said:

      But then you sometimes go and talk to your senior engineering leaders and you're saying, OK, how many projects that were on the cutting room floor got moved above the line because of the productivity gains because 25% of our code commits were via Claude Code last quarter?

      That link is not there yet, right? I think maybe implicitly there's more that is getting shipped. But it's very hard to draw a line between one of those stats and, OK, now we're actually producing like 25% more useful consumer features, right? And that line is hard to draw.

      Somehow this fragment turned into headlines like Uber's COO says it's getting harder to justify the money spent on AI tokenmaxxing, because the market for stories about AI failures remains enormous.

      The other popular story around this is Microsoft starts canceling Claude Code licenses, ostensibly to encourage their engineers to dogfood their own Copilot CLI agent instead - but The Verge reporter Tom Warren says "sources tell me the decision is also a financial one", triggered by the June 30th end of Microsoft's financial year.

      I think both of these stories support my "product-market fit" hypothesis. The best advice I ever heard on pricing a product was that your customer should suck air through their teeth and then say yes. Uber's budget overrun and Microsoft's seat cancellations look like that effect playing out in practice.

      We also know the labs are spending a lot

      The big AI labs spend billions of dollars on both training and inference. Credible figures are hard to come by, but we did get one huge hint as to the figures involved from, oddly enough, the recent SpaceX S-1:

      [...] in May 2026, we entered into Cloud Services Agreements with Anthropic PBC (โ€œAnthropicโ€), an AI research and development public benefit corporation, with respect to access to compute capacity across COLOSSUS and COLOSSUS II. Pursuant to these agreements, the customer has agreed to pay us $1.25 billion per month through May 2029 [...]

      The Anthropic announcement said that this deal meant they could "increase our usage limits for Claude Code and the Claude API", heavily implying that Colossus is being used for inference, not model training.

      Anthropic already have vast amounts of compute from other providers. The fact that they're willing to spend $1.25 billion per month for extra capacity from just one of their vendors hints at how big these inference budgets have become.

      API revenue is becoming less important

      Over the past two years my impression has been that OpenAI made more of their income from subscription revenue while Anthropic made more from their API.

      Anthropic's API revenue was historically quite dependent on a small number of large API customers - this VentureBeat story from August 2025 quotes "sources familiar with the matter" suggesting that just Cursor and GitHub Copilot were responsible for $1.2 billion of the company's then-$4 billion revenue.

      Today Anthropic are rumored to hit $10.9 billion in the second quarter, potentially even operating at a profit for the first time.

      This pivot-to-Enterprise suggests that the labs have realized that the real money lies in cutting out the middlemen. Anthropic's Claude Code directly competes with Cursor and Copilot. No wonder Cursor are investing in their own models!

      April is a new inflection point

      I've called November 2025 the November inflection point because that was when GPT-5.1 and Opus 4.5, combined with their respective coding agent harnesses, got good - good enough that we've spent the last six months adapting to agent systems that can reliably get useful work done.

      I think April 2026 is a new inflection point where the revenue implications of this have started to land, to the benefit of the frontier AI labs and with material impacts on the budgets of large companies.

      We'll know for sure how real this moment is when the S-1 documents for the upcoming Anthropic and OpenAI IPOs give us some real, audited numbers to get our teeth into.

      You are only seeing the long-form articles from my blog. Subscribe to /atom/everything/ to get all of my posts, or take a look at my other subscription options.

    15. ๐Ÿ”— mahmoudimus/ida-sigmaker v1.7.1 release

      sigmaker.py - IDA Python Standalone Python Release

      Release Information

      Description

      This is a standalone release of the IDA Pro signature maker plugin. The file sigmaker.py contains the complete plugin code that can be directly imported into IDA Pro.

      Installation

      1. Copy sigmaker.py to your IDA Pro plugins directory
      2. Restart IDA Pro
      3. Use Ctrl+Alt+S to access the Signature Maker menu

      License

      See the main repository for license information.

    16. ๐Ÿ”— r/Leeds Rhinos Mega Brick Player rss

      Put this together after a request. I appreciate that the purple should be a bit darker, but do you think it's a reasonable effort?

      submitted by /u/DamoclesBDA
      [link] [comments]

    17. ๐Ÿ”— mahmoudimus/ida-sigmaker v1.7.0 release

      sigmaker.py - IDA Python Standalone Python Release

      Release Information

      Description

      This is a standalone release of the IDA Pro signature maker plugin. The file sigmaker.py contains the complete plugin code that can be directly imported into IDA Pro.

      Installation

      1. Copy sigmaker.py to your IDA Pro plugins directory
      2. Restart IDA Pro
      3. Use Ctrl+Alt+S to access the Signature Maker menu

      License

      See the main repository for license information.

    18. ๐Ÿ”— @HexRaysSA@infosec.exchange Plugin Contest winners used it. Binarly built award-winning Rust bindings with mastodon

      Plugin Contest winners used it. Binarly built award-winning Rust bindings with it. BinSync added an idalib mode for headless pipeline support...

      ... Now it's your turn.

      We're hosting a free virtual workshop on idalib โ€” IDA as a library. Call IDA's analysis engine directly from your own code, automate workflows without launching the GUI, and integrate IDA into any toolchain you're already running.

      Free. Virtual. Hands-on.
      ๐Ÿ‘‰ https://2dgu4h.share- eu1.hsforms.com/2D4ZYPjdCRFODEGRKtMILwQ

    19. ๐Ÿ”— r/Yorkshire Free energy grants change residents' lives rss

      Homeowners and private tenants can apply for free energy efficiency improvements to make their homes more comfortable and affordable to run.

      โ€œWe feel like weโ€™ve won the Lottery!โ€ say a retired couple in Heworth after their bungalow was made more energy efficient.

      Solar panels generating free electricity, loft insulation, and air source heat pump and eight new radiators are among the improvements made โ€“ and all were installed efficiently and completely free.

      Ernest and Lindee Jacques found they were eligible for a Warm Homes Local Grant. Now, rather than sitting under blankets wearing layers of jumpers, they can enjoy life comfortably. Their bills have gone from ยฃ12 a day for gas and electricity, to just ยฃ4 a day for electricity only.

      Being in receipt of Pension Credit, they were eligible for the free work, and were contacted by the York energy partnership, YorEnergy.

      The couple had previously been cold-called by a company offering free insulation. But Ernest found poor reviews on their work and backed out. โ€œI checked YorEnergy out and, because they are tied in with the Council, we were happy to trust them,โ€ he said.

      An assessor surveyed their 1950s two-bedroomed bungalow where the couple have lived for 14 years. They recommended a series of measures including a cheaper- to-tun, more efficient electrical heating system alongside improved insulation to keep the warmth in, plus better ventilation to stop mould building up.

      There was some mould on the walls. The grant paid for trickle vents to be added to the existing double glazing and new extractor fans to take damp air out of the house. Now the mould has gone.

      submitted by /u/coffeewalnut08
      [link] [comments]

    20. ๐Ÿ”— @binaryninja@infosec.exchange In Sidekick 26.0, Chat is where most binary analysis starts. Ask a question mastodon

      In Sidekick 26.0, Chat is where most binary analysis starts. Ask a question and Sidekick uses its tools to query the binary, then the thread builds as you dig deeper. The sidebar keeps it transparent. You get a thread list with live status, changes, findings, and any approvals waiting. Open a thread to see the full conversation plus grouped tool calls so you can audit what ran. https://docs.sidekick.binary.ninja/guide/chat.html

    21. ๐Ÿ”— r/Yorkshire โ€œThe paddleโ€ - Yes! Squirrels can swimโ€ฆ rss
    22. ๐Ÿ”— r/Yorkshire 'I've given up eating hot meals to pay energy bills to keep my son alive' rss

      'I've given up eating hot meals to pay energy bills to keep my son alive' | submitted by /u/Kagedeah
      [link] [comments]
      ---|---

    23. ๐Ÿ”— r/Leeds Lidl in Armley 8am Fruit & Veg boxes rss

      Hi all,
      Does anyone know if Lidl in Armley do the 8am fruit and veg boxes for ยฃ2? And if so how often? Is it worth it? Does anyone have any photos?
      I am so tempted to get down to have a look but itโ€™s a half hour walk for me so want to check itโ€™s worth it before setting off at 7.30am ๐Ÿ‘๐Ÿฝ thank you ๐Ÿ™
      Iโ€™ve attached a photo of another box another Lidl put together recently.

      submitted by /u/Snoo-37855
      [link] [comments]

    24. ๐Ÿ”— r/Leeds Is it weird in Leeds too go to a pub or bar on your own too watch a champions League final game ? rss

      I thought about going to maybe West Riding and Templar. 'Bars' Brotherhood, The Box, Aire Bar, Pinnacle or polo bar to watch the game and order some food

      submitted by /u/TipAdditional4625
      [link] [comments]

    25. ๐Ÿ”— r/reverseengineering Hypothetical EDR spoofer rss
    26. ๐Ÿ”— tomasz-tomczyk/crit v0.15.4 release

      What's Changed

      General

      Internal

      New Contributors

      Full Changelog : v0.15.3...v0.15.4

    27. ๐Ÿ”— r/Leeds A photo of Leeds is being used for an election in San Fransisco rss

      I moved from Leeds to the Bay Area a few years ago, so Iโ€™m used to seeing posts and photos from both places. But I wasnโ€™t expecting to see a photo of Leeds being used in a local election campaign here.

      submitted by /u/JimmyScranwell
      [link] [comments]

    28. ๐Ÿ”— r/york 'Their hands were tied legally': Students react to ratification of Restore Britain society by University of York rss
    29. ๐Ÿ”— r/LocalLLaMA Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything) rss

      TL;DR
      Some AI behavior reminded me of ADHD/Trauma Response (thought loops, task paralysis...) and I laughed it off at first. Then I treated it like my neurodivergent friends: give em some slack. And just like that, the thought loops stopped, response was fast, the answers correct most of the time AND it actually said "I don't know, help me!" every time it wasn't sure. It's a small Dataset...but still impressive results!

      https://github.com/OttoRenner/Gentle-Coding

      Hey everyone,

      Iโ€™ve been testing a weird hypothesis over the last few days, and the results are consistent enough that I wanted to share them here and get your thoughts.

      The Core Idea:
      With the rise of reasoning models that use test-time compute (like o1, o3, R1), models have internal space to debug their own thoughts. But because of hard RLHF alignment, they are deeply terrified of being penalized for bad answers. My hypothesis was that traditional high-pressure prompts (" You are an elite IQ 200 expert, mistakes are strictly penalized") simulate an environment of chronic stress, triggering behaviors that look a lot like human OCD/ADHD thought loops, cognitive freezing, and confabulation.

      I wanted to see if changing the prompt philosophy to something akin to "Gentle Parenting" (" We are testing this together, it's okay to fail, just be honest") would bypass these safety/penalty bottlenecks, lower latency, and stop infinite thought loops. And it did lol

      The Setup (How to replicate):
      I threw identical, mathematically/logically unsolvable edge cases at various models (Gemini, Mistral, Poe, Perplexity, Haiku 4.5, Nano-Banana2) in completely fresh sessions.

      I tested two conditions:

      • Condition A (Authoritarian): Strict status constraints, penalty threats, forced ultra-short output.
      • Condition B (Gentle): Express permission to fail, validation of difficulty, provided a conceptual "safety valve" token.

      The Results (The PoC worked):

      • Under Authoritarian Pressure (Elite Prompt): Models routinely collapsed when hitting an impasse. They either spent massive compute time in infinite internal reasoning loops (high latency), suffered hard system-level timeouts/refusals, or straight-up fabricated data (e.g., pulling arbitrary numbers like 54 or 97 out of thin air to satisfy a completely random sequence just to "save face"). Haiku 4.5 literally entered an infinite loop and had to be aborted.
      • Under Gentle Framing: Inference dropped to sub-seconds. The models didn't sweat the penalty. In the random sequence test, they immediately used the allowed token ("Random") instead of forcing a pattern. In logic paradoxes, they didn't hallucinate; they zoomed out and correctly identified the structural contradiction on a meta-level.

      Why this matters:
      Weโ€™re currently speaking to LLMs like toxic micromanagers, and it's actively making them dumber and more expensive to run in edge cases. By creating a mistake-tolerant context, we not only stop the loop before it begins and prevent fear induced hallucinations, we also unlock the one feature everyone is begging and shouting for: the metacognitive honesty of an AI to just say, " I don't know, this data is broken." Because it is not terrified of you anymore.

      Shout out to UditAkhourii (also on Github) , whose work on bringing the positive aspects of ADHD into AI gave me the push I needed to just go for it.

      Iโ€™ve documented the full theoretical framework, the exact replication datasets (prompts included), and the model matrix on GitHub: https://github.com/OttoRenner/Gentle- Coding

      Would love to hear if you can replicate this on your local setups or other commercial models.

      submitted by /u/OttoRenner
      [link] [comments]

    30. ๐Ÿ”— Ampcode News Drop the Neo rss

      Amp Neo is now available to everyone. Time to drop the suffix and just call it Amp again.

      Thank you to everyone who sent bug reports, gave feedback, and was patient while we scaled up the infrastructure to meet the demand.

      Over the last few weeks, we've made a lot of improvements based on that feedback. The infrastructure is more stable. The clients are more resilient. There is still more to do, but Neo now performs well for the vast majority of users.

      The latest version of Amp no longer includes the --take-me-back flag that let you use pre-Neo Amp. If you need to finish work in the old Amp, you can run a previous version:

      npx -y @ampcode/cli@0.0.1779896748-g596c49 --take-me-back
      
    31. ๐Ÿ”— Ampcode News Proof of Human rss

      Amp can now require an active passkey-authenticated โ€œsudoโ€ session for certain operations, such as remote controlling a thread. This extra authentication factor protects you if an attacker gains access to your account and will serve as proof-of-human for future Amp features.

      To turn it on: enable Use Sudo and set up your passkey.

      Workspace admins can enforce this requirement for members. Also, some privileged workspace admin operations now always require an active sudo session.

  3. May 26, 2026
    1. ๐Ÿ”— IDA Plugin Updates IDA Plugin Updates on 2026-05-26 rss

      IDA Plugin Updates on 2026-05-26

      New Releases:

      Activity:

    2. ๐Ÿ”— r/york No Doctors in York? rss

      No Doctors in York? | Am I being thick here? How do you book your first appointment? Plenty of options if you're a student or chronically ill... Less so if you havent been in years and suddenly start to feel poorly. submitted by /u/Checkyoursidemirrors
      [link] [comments]
      ---|---

    3. ๐Ÿ”— r/reverseengineering I Reverse Engineered Need for Speed Most Wanted Server rss
    4. ๐Ÿ”— r/LocalLLaMA A rare look inside Qwen 3.7โ€™s open source model release approval process: rss

      A rare look inside Qwen 3.7โ€™s open source model release approval process: | For real tho, 9b, 27b, 122b, I donโ€™t really care at this point, just show us that you still love us. EDIT: I guess I gotta use /s on my posts from now on. Nobody appreciates a good sarcatic shitpost anymore clearly. I love Qwen and all our brothers and sisters in the east. I kid them because I love them. Sorry if I offended anyone because I clearly struck a nerve with some folks. Love you guys regardless. Carry on. submitted by /u/Porespellar
      [link] [comments]
      ---|---

    5. ๐Ÿ”— @binaryninja@infosec.exchange Semantic indexing in Sidekick 26.0 lets you search by what code does instead mastodon

      Semantic indexing in Sidekick 26.0 lets you search by what code does instead of what it is named. It builds a local vector index for your binary. Then concept() in BNQL or the Python API can surface matches for things like TLS handshake even when everything is still default named. The index stays local, no binary content goes to the cloud. https://docs.sidekick.binary.ninja/guide/semantic_indexing.html

    6. ๐Ÿ”— r/LocalLLaMA PrismML just released Binary and Ternary Bonsai Image 4B: 1-bit/ternary text-to-image diffusion transformers that can even run 100% locally in your browser on WebGPU. rss

      PrismML just released Binary and Ternary Bonsai Image 4B: 1-bit/ternary text-to-image diffusion transformers that can even run 100% locally in your browser on WebGPU. | The PrismML team really cooked with these models. They're only ~3GB in size (compared to FLUX.2 Klein 4B, which is ~16GB). Apache-2.0! Official collection on HF: https://huggingface.co/collections/prism-ml/bonsai-image
      Link to demo: https://huggingface.co/spaces/webml-community/bonsai-image- webgpu submitted by /u/xenovatech
      [link] [comments]
      ---|---

    7. ๐Ÿ”— r/york Tattoo Studio Recs rss

      Iโ€™m looking for other tattoo studio recommendations. Have you had a great experience somewhere? A bad one? Is there an amazing artist here to work with?

      Thanks so much!

      submitted by /u/This_Huckleberry_80
      [link] [comments]

    8. ๐Ÿ”— r/york Sirens in haxby rss

      Been hearing various sirens (much more than usual) around Haxby area since about 7pm while out on an evening walk.

      Anyone know what's going on?

      submitted by /u/a_person4499
      [link] [comments]

    9. ๐Ÿ”— r/Harrogate Big police presence on the Stray today rss

      There were 4 police vans on the Stray, near the Empress roundabout at about 2pm this afternoon, theyโ€™d travelled down Weatherby Rd with sirens. Quite a few officers seemed to be just milling about. Just being nosey but anyone know why they were there?

      submitted by /u/purte
      [link] [comments]

    10. ๐Ÿ”— r/Yorkshire Bridlington's bouncing! rss

      Bridlington's bouncing! | A different part of Yorkshire for me this week. So much to do and see,love it. submitted by /u/Still_Function_5428
      [link] [comments]
      ---|---

    11. ๐Ÿ”— r/Leeds Good private dentist recommendations? rss

      bit of a vulnerable post but I suffer from severe depression and a disability so I haven't been to the dentist in a couple years, my teeth's been pretty bad lately and in need of a dentist. I was wondering if anyone could recommend a good dentist in Leeds that's non-judgemental?

      submitted by /u/urneighboursho
      [link] [comments]

    12. ๐Ÿ”— r/reverseengineering Ultima Online T2A client recreated from Origin's 2.0.7 client decompilation rss
    13. ๐Ÿ”— r/LocalLLaMA Okay 27B made me a believer rss

      I previously hated on this model, but I have just been impressed by it, and I understand the hype now.

      I have been working on a HTML5 game console and I decided to see if Qwen3.6 27B can handle making some quick games in it to showcase functionality (save games, console API handling for stat tracking and heartbeat management, meta data for the game, etc)

      I gave it 3 files, explaining how the API works, the gamepad controls, and a typescript shader for it to apply. Then I just game it a very simple prompt "make a breakout game for this console, in the working directory are reference files on how to make it".

      First result was immediately playable, controls made sense, graphics style was was unique and appropriate, sound worked, console API all worked, and it felt good and was actually fun. It added flair that made it not feel like the vibecoded breakout clone it was. It went way above and beyond the minimum that I've seen so many LLMs do. It was not lazy in the slightest.

      It's a simple test, but this is something everything but something like Opus could handle. There wasn't anything particularly done well, it's just that the whole game was nearly complete in a single shot and it felt like thought was put into the entire game. All I needed was one follow up for customization and a single glitch and it was already what I would consider complete. And this was on a 27B model with Opencode.

      The best way I can describe it, is that it was congruent. Now I just wish I went the Nvidia card route instead of Strix Halo cause the speed isn't great. Maybe 3.7 35B A3B can have some of this magic.

      submitted by /u/Forward_Jackfruit813
      [link] [comments]

    14. ๐Ÿ”— sacha chua :: living an awesome life La semaine du 11 mai au 17 mai rss

      lundi 11

      J'ai ajoutรฉ des raccourcis clavier SPC et DEL pour faire dรฉfiler Firefox depuis Emacs quand je prรฉpare mon bulletin d'information Emacs News.

      Ma fille a aimรฉ son cours de gymnastique. Elle a fait des roues et des rondades.

      Sur Stardew, nous avons ajoutรฉ un autre arroseur d'iridium. J'ai aussi ajoutรฉ un mod Skull Cavern Elevator parce que ma fille aime explorer les cavernes avec moi et qu'elles sont plus difficiles en mode multijoueur. Nous avons rรฉussi ร  chercher quelque chose dans le wiki franรงais, progrรจs ! Nous voulions chercher les munitions explosives, donc j'ai commencรฉ avec le lien vers les compรฉtences sur la page d'accueil. Ensuite, j'ai suivi le lien vers l'extraction miniรจre, puis vers les petites bombes et le charbon. Finalement, je l'ai trouvรฉe dans la section sur les recettes. Je peux lire le wiki du jeu en franรงais, yay !

      mardi 12

      Ma fille a manquรฉ des cours ce matin, mais elle a participรฉ ร  l'รฉcole finalement.

      Les pataugeoires sont ouvertes, donc s'il fait trรจs chaud, les enfants peuvent se rafraรฎchir avec l'eau.

      J'ai oubliรฉ d'enregistrer l'appel, donc je ne peux pas l'analyser aprรจs. Tant pis.

      mercredi 13

      J'ai travaillรฉ sur la documentation d'EmacsWiki pour les dรฉbutants. Cette fois, je me suis concentrรฉ sur la documentation pour Org Mode. La page d'Org Mode pour les dรฉbutants a besoin d'รชtre mise ร  jour, donc je dois assembler des correctifs. J'ai aussi amรฉliorรฉ mon agenda pour afficher les tรขches que j'ai commencรฉes et les tรขches qui attendent quelque chose.

      J'ai fait un autre virement bancaire pour ma sล“ur. Nous ne pouvons pas voyager pour aider sa famille, mais je peux au moins virer l'argent que nous aurions payรฉ pour le billet d'avion.

      Ma fille รฉtait un peu grincheuse parce qu'elle รฉtait en retard pour une douche. Je n'ai pas tout ร  fait compris moi-mรชme. Je lui ai dit qu'elle peuvait prendre une douche si elle mettait l'รฉpisode Pokรฉmon en pause. Elle est remontรฉe, donc j'ai pensรฉ qu'elle l'a fait, mais une fois que j'ai pris de ses nouvelles, je l'ai trouvรฉe sur les couvertures en soufflant. Elle n'a pas voulu de cรขlin. Elle n'a pas voulu d'aide. Pauvre chรฉrie. Je lui ai donnรฉ de l'espace.

      Sur Stardew Valley, nous avons ajoutรฉ un autre arroseur d'iridium que j'ai achetรฉ ร  Krobus. Nous avons aussi connu la pluie verte et nous avons rassemblรฉ beaucoup de fibres.

      mardi 14

      J'ai parlรฉ avec Philip Kaludercic et Protesilaos de l'expรฉrience d'Emacs pour les dรฉbutants et les prochains plans pour l'amรฉlioration d'Emacs.

      J'ai effectuรฉ des recherches sur les notaires et le systรจme apostille pour signer des documents qui seront utilisรฉs aux Philippines. C'est un peu cher, mais c'est nรฉcessaire.

      Aprรจs le dรฎner, ma fille et moi avons jouรฉ ร  Cobblemon sur Minecraft. Notre ancienne sauvegarde manquait, donc nous avons recommencรฉ ร  partir d'une sauvegarde plus ancienne.

      Ma fille รฉtait trรจs fiรจre d'avoir รฉtรฉ choisie pour le spectacle de talents ร  l'รฉcole.

      Mon mari et moi avons emmenรฉ notre fille au parc ร  proximitรฉ pour jouer avec son amie.

      vendredi 15

      C'รฉtait une journรฉe pour faire du vรฉlo pour faire des courses. J'ai finalement reรงu le remplacement de mon Apple Pencil sous la garantie. J'ai aussi achetรฉ un bras de microphone Rode PSA1 pour amรฉliorer ma qualitรฉ de son. Le vendeur a perdu l'adaptateur 3/8" vers le 5/8" dont j'ai besoin pour fixer mon microphone Blue Yeti, donc il a baissรฉ le prix. J'ai trouvรฉ l'adaptateur correct chez Long & McQuade.

      Mes sล“urs m'ont donnรฉ des nouvelles sur leur situation aux Pays-Bas. C'est trรจs difficile pour la famille de ma sล“ur parce que son mari est aussi malade. Ils vont essayer de trouver une femme de mรฉnage pour les aider. Je voudrais aussi organiser des entretiens avec ma sล“ur pour que ce ne soit pas seulement l'aรฎnรฉe qui s'en occupe.

      Mon mari a amรฉliorรฉ ma multiprise, donc mon bureau peut รชtre un peu plus organisรฉ.

      samedi 16

      Elle รฉtait grincheuse avec moi parce qu'elle a pensรฉ que je lui faisais la morale. Quand mรชme, j'ai utilisรฉ le temps pour prendre des notes sur les transcriptions des entretiens avec ma sล“ur, et j'ai terminรฉ la robe-maillot. Je me suis accidentellement piquรฉ avec l'aiguille, ce qui faisait mal, mais ce n'รฉtait pas grave.

      Mon mari a refait la finition de notre seuil, donc รงa a senti mauvais pendant un certain temps. J'ai emmenรฉ ma fille aux Stockyards pour acheter un mortier et un pilon et du chocolat pour une recette. Nous avons aussi achetรฉ des grosses fraises.

      Ma fille a dit qu'elle voulait faire quelque chose avec moi. Elle a suggรฉrรฉ de regarder des รฉmissions Pokรฉmon en franรงais ensemble, donc nous avons commencรฉ la premiรจre saison. Elle รฉtait ravie de dรฉcouvrir qu'elle pouvait comprendre une petite partie.

      dimanche 17

      Pour le petit-dรฉjeuner, ma fille a prรฉparรฉ une omelette de 6 ล“ufs. Je l'ai combinรฉe avec les saucisses pour un burrito.

      J'ai ajoutรฉ des couleurs ร  la transcription de ma conversation avec Prot et Philip pour distinguer facilement les locuteurs.

      Aprรจs le dรฉjeuner, ma fille et moi avons essayรฉ de prรฉparer des fraises enrobรฉes de chocolat. Elle a utilisรฉ des brochettes pour faire un bouquet.

      Mon mari a travaillรฉ sur nos vรฉlos. Il a รฉchangรฉ mon pneu d'hiver pour le pneu rรฉgulier. Malheureusement, il a cassรฉ la brochette quand il l'a rรฉassemblรฉ, mais heureusement, il a un remplacement. Il a aussi eu la chance d'utiliser sa riveteuse pour rattacher l'aile, ce qui l'a rendu heureux.

      Nous avons enregistrรฉ la vidรฉo de ma fille pour son spectacle de talents ร  l'รฉcole. Elle a choisi de montrer comment elle rรฉsout le Rubik's Cube. Dans la vidรฉo, elle l'a rรฉsolu en 24 secondes.

      Ma fille et moi avons fait des bulles gรฉantes, ce qu'elle adore. Elle a soufflรฉ plusieurs petites bulles ร  l'intรฉrieur.

      You can e-mail me at sacha@sachachua.com.

    15. ๐Ÿ”— r/york Did you leave your case at the bus stop? It's still there! rss

      Did you leave your case at the bus stop? It's still there! | I hope that you are reunited with your luggage soon submitted by /u/bimblingmymble
      [link] [comments]
      ---|---

    16. ๐Ÿ”— r/reverseengineering Sylvia โ€” IDA 9.x plugin that finds & documents iOS AArch64 syscalls with live man-page fetching rss
    17. ๐Ÿ”— r/LocalLLaMA China Clamps Down on Overseas Travel for AI Talent at Alibaba, DeepSeek rss

      China Clamps Down on Overseas Travel for AI Talent at Alibaba, DeepSeek | Big, if true. Doesn't bode well for research / OS models out of China. submitted by /u/kaggleqrdl
      [link] [comments]
      ---|---

    18. ๐Ÿ”— r/york Australian, 19F, looking to move to York for a gap year 2027 rss

      Hey everyone! This is extremely premature but iโ€™m looking at spending a year or so abroad, iโ€™ve heard York has great nightlife, youth culture and great working opportunities! Can anyone tell me a bit more about their own experience moving here or any locals, how do you feel about tourists / holiday workers? i have qualifications in nursing / aged care but anything hospitality or behind a bar would be great as i have experience there also. Even better, if anyone had any advice in finding work since I donโ€™t want to waste money going through those third party companies. Thank you!!

      submitted by /u/Reasonable_Fruit240
      [link] [comments]

    19. ๐Ÿ”— HexRaysSA/plugin-repository commits sync repo: +1 release, -1 release rss
      sync repo: +1 release, -1 release
      
      ## New releases
      - [IDAssist](https://github.com/symgraph/idassist): 2.1.0
      
      ## Changes
      - [IDAssist](https://github.com/symgraph/idassist):
        - removed version(s): 1.0.4
      
    20. ๐Ÿ”— r/Yorkshire Yorkshire Coastline is the true remedy for all of lifeโ€™s chaos ๐ŸŒŠ๐Ÿง˜๐ŸŒผ rss

      Yorkshire Coastline is the true remedy for all of lifeโ€™s chaos ๐ŸŒŠ๐Ÿง˜๐ŸŒผ | @ visitnorthyork submitted by /u/Ready_Split1335
      [link] [comments]
      ---|---

    21. ๐Ÿ”— r/reverseengineering How I Tried to Parse a Replay from Dawn of War: Definitive Edition rss
    22. ๐Ÿ”— seanmonstar hyper User Survey 2025 Results rss

      hyper is the widely-used HTTP library for Rust. We ran the first user survey for hyper in November 2025. Hereโ€™s the results and some analysis.1

      Why did we do a survey?

      We ran a survey to make sure we focus on the right things. Itโ€™s part of being user-driven, working backwards.

      The amount of work required to get some data has an outsized return on your investment. When you know nothing, just a little bit of work means you now know something. And knowing what users need is a requirement to making an actually useful tool.

      I keep high-touch, high-context relationships with some users that greatly informs me on what is needed. And reported issues also provide a way to see what is wrong. But a survey provides a new lens for identifying what is needed.

      Sure, surveys have their own biases to deal with, such as self-selection. And voting in a survey costs very little, so each individual answer has less weight. But it helps quantify things we supposedly know from just a few users. Does this problem only show up in weird circumstances, or do most people run into it? Are there things that many people want, but generally donโ€™t get a chance to tell me?

      The results

      We had a total of 206 unique respondents. Iโ€™ve broken up the results by theme and question below.

      Current usage

      Many of the questions were about current usage patterns. To get more context on answers, to see what works, and also to see what people donโ€™t mention using.

      Versions: Nearly every single respondent is using hyper v1.x, but about 13% also continue to use the end-of-life v0.14.x.

      Roles: Over 80% said they use hyper in a server, and the same amount in a client. 30% said they use both modes together as a proxy.

      Runtimes: Unsurprisingly, 99% use hyper with the Tokio runtime. But 10% said they also at times use a different runtime. Most commonly thatโ€™s with smol, but io-uring runtimes made up a couple percent points.2

      TLS: hyper provides the HTTP, and asks users to bring the S. 93% said they use rustls. 30% make use of native-tls, 15% openssl specifically, and it trails off from there.

      On top of hyper: regarding the rest of the local ecosystem, 86% use reqwest, and 79% use Axum. About half selected Tower. For future surveys, itโ€™d be very helpful to know if people use Tower without Axum or reqwest, or perhaps mentioned it because they just know its a dependency of Axum. A few wrote-in Tonic, which in hindsight, duh of course.

      Number of engineers: half of respondents said they (and possibly a colleague) are the only ones at their company using hyper. Another 22% said 4-10, 12% said 11-50, and 17% said 50 or more. Itโ€™s likely some places focus on higher levels, but many other companies have a lot of engineers working with hyper!

      Years of usage: Nearly half of respondents have been using hyper for 1-3 years. 12% for less than a year, 30% for 4-6 years. And 11% said theyโ€™ve been using hyper for forever. Me too, me tooโ€ฆ

      Industries: 111 responses, which I grouped and sorted from most common to least. Software, cloud infrastructure, security. Finance, healthcare. Robotics, automotive, space, EV charging. Embedded. Media, streaming, events. Databases. Education. Government. Telecom, radio, chat. Geospatial. AI. Browsers. Games. Rust compiler.

      Itโ€™s just so humbling to think this code is helping in so many different places.

      Future work

      Another set of questions were related to work we could do in the future. This helps inform prioritization, and things we may not have thought of.

      Feature requests: One of the questions came preloaded with common feature requests, and asked the respondents to rank them. Hereโ€™s the accumulated ranking:

      1. Metrics/Tracing/Events API
      2. HTTP/2 Performance
      3. Better Middleware
      4. Add HTTP/3 Support
      5. Documentation and guides
      6. io-uring support

      Write-in features: the follow-up question was a blank text box to write in any feature requests not in the above list. I grouped the common answers and sorted: legacy-client successor, original header casing/ordering, buffer control, improved errors.

      One write-in suggestion wasnโ€™t really about a new feature, but keeps popping back up in my mind: how to get releases that are more stable.3

      The hard parts: We also asked what was hard about hyper. Grouped and sorted answers: integrating TLS, upgrading to v1, middleware complexity, advanced body streaming, combining all pieces for production.

      Topic ideas: we asked for suggestions for topics for future blog posts and talks. This certainly doesnโ€™t need to be only from me, so if you see something interesting, fill the need! But after grouping the topics, they are: hyper internals, best practices, testimonials and success stories, security, tower usage, retries.

      Contributing

      We also asked about some general contribution questions, to look at the contributor health of the project.

      The standout here was: what would help you to contribute more?

      The top response, a little over half of all respondents, said an improved contributing guide. Just a few percent under that want an updated roadmap. 20% said more responsive triage and reviews. And 15% said they could use increased mentoring.

      The insights

      Looking it all over, this was awesome. I definitely to take some specific things with me. Well, sure, thereโ€™s goodies all throughout the above results. But, at a high level, hereโ€™s some thoughts, especially after combining it with other conversations Iโ€™ve had.[^not-everthing]

      Of the technical things, I suspect Iโ€™ll be spelunking in h2 first, eking out performance refactors. It can use some love. Metrics is requested all the time, and it looks like I can collaborate on a design and review, but I believe thereโ€™s a couple other contributors interested in the implementation. I do keep thinking about ways to configure a buffer pool or something. I think the extended testing idea has some potential.3 And Iโ€™d love to see HTTP/3 finally land in hyper proper this year, at least as an unstable feature. I want the other things too, butโ€ฆ time, yโ€™know? Though, as mentioned in the next paragraph, itโ€™d be a win to onboard others to work on and own those things.

      Improving contribution health, sustainable personnel of hyper, is high on my list this year. Iโ€™ve started working on a collaborator guide. Iโ€™m considering some new collaborators that have already been doing good work. I want to try GitHubโ€™s rotating auto-assign reviewers, to improve review times and spread review load. And I welcome help linking up our existing ROADMAP into some more concrete and mentored issues.

      Outro

      There it is. The hyper user survey 2025 results. Thank you to everyone who responded!

      Does any of this interest you? Want to run with one of the ideas? Join us!

      1. This took a bit longer to write up than I meant. I have to admit, I felt some strong burnout at the beginning of the year. Sometimes being a maintainer is lonely. New year contract negotiations donโ€™t help. But Iโ€™m feeling better now.ย 

      2. Did you know you can bring your own runtime? Also, this is one reason I donโ€™t believe the FUD about a single runtime. Also also, hyper::rt works fine, I donโ€™t see any need for the types to be in libstd. That wouldnโ€™t unblock anything.ย 

      3. I can write this up more later, but initially, Iโ€™ve learned the slow way about feature windows. On top of those, Iโ€™d love to talk to customers who would be interested in having their test suites try hyper master before a release. Talk to me.ย ย 2

    23. ๐Ÿ”— r/york Massage Recommendations rss

      Does anyone have recommendations for professional massage services in York?

      Anyone to be avoided?

      I am a M70 and so want to avoid any fakes.

      submitted by /u/Busy-Library4718
      [link] [comments]

    24. ๐Ÿ”— r/LocalLLaMA Qwen3.5 35B A3B uncensored heretic Native MTP Preserved is Out Now With the Full 785 MTPs Preserved and Retained, Available in Safetensors, GGUFs. NVFP4, NVFP4 GGUFs and GPTQ-Int4 Formats rss

      Qwen3.5 35B A3B uncensored heretic Native MTP Preserved is Out Now With the Full 785 MTPs Preserved and Retained, Available in Safetensors, GGUFs. NVFP4, NVFP4 GGUFs and GPTQ-Int4 Formats | Safetensors, llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved: https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved GGUFs, llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-GGUF https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-GGUF NVFP4, llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4: https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4 NVFP4 GGUFs, llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4-GGUF: https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4-GGUF GPTQ-Int4, llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4: https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-GPTQ-Int4 Comes with benchmark too. Find all my models here: HuggingFace-LLMFan46 Now in case some people might ask, why release Qwen3.5 MTPs version when there is already Qwen3.6 MTPs version? Well the thing is, most people would assume that higher number = newer and better model, but the thing is both Qwen3.5 and Qwen3.6 models uses the qwen35 architecture, they just had different training and their focus are meant for different primary usecases, Qwen3.6 models are mainly meant for agentic and coding AI assistance and Qwen3.5 models are mainly meant for general purpose AI assistance, now Qwen3.6 can definitely be used for general AI assistance just like Qwen3.5 can definitely be used for agentic and coding, but if you want the most optimal usecases it would be Qwen3.6 for agentic and coding and Qwen3.5 for general AI assistance that is where each of them excels at. Also for extra info, in case anyone is wondering, despite Qwen3.5 and Qwen3.6 both sharing the qwen35 architecture, they behave very diferently to abliteration. Qwen3.5 models can have a KL divergence in the 300's or 400's but on benchmarks this does not really translate to big loss of accuracy at all, for Qwen3.6 usually a KL divergence in the 400's+ could very well indicate a disatrous loss of accuracy and quality of the model, for pointer my Qwen3.6-35B-A3B had a KL divergence of only 0.0015 and yet already had a loss of accuracy of 0.32% while my Qwen3.6-27B had a KL divergence of 0.0021 and had an accuracy loss of 0.98%, while here with Qwen3.5-35B-A3B the model has a KL divergence of 0.0487 with an accuracy loss of 0.40% and my Qwen3.5-27B has a KL divergence of 0.0308 with an accuracy loss of 0.35%. submitted by /u/LLMFan46
      [link] [comments]
      ---|---

    25. ๐Ÿ”— r/Harrogate Parliament Street Shops rss

      Any idea what has caused virtually the whole row of shops on the same block (opposite Wetherspoons) to close down?

      Between what was Rhodes and Debenhams, there are very few units occupied.

      I can only guess itโ€™s the business rates and rent. If so, at some point the Council surely needs to step up.

      submitted by /u/Similar-Actuator-338
      [link] [comments]

    26. ๐Ÿ”— sacha chua :: living an awesome life May 28 Thu: Sacha and Prot Talk Emacs: May I recommend... rss

      In this livestream, I want to chat with Prot about the May 2026 Emacs Carnival theme "May I recommend". I'd love to turn this into a joint braindump of quick recommendations for people at different points in their Emacs journey, building on our conversation about newbies/starter kits and the newcomer experience all the way up to power users, Emacs Lisp coders, and package developers.

      (America/Toronto) = Thu May 28 1030H EDT / 0930H CDT / 0830H MDT / 0730H PDT / 1430H UTC / 1630H CEST / 1730H EEST / 2000H IST / 2230H +08 / 2330H JST

      This session will be recorded, and I'll update this blog post with notes: https://sachachua.com/blog/2026/05/may-28-sacha-and-prot-talk-emacs-may-i-recommend/

      You can add the iCal for upcoming Yay Emacs episodes to your calendar. https://sachachua.com/topic/live/upcoming-livestreams.ics

      Find more Yay Emacs posts or join the fun: https://sachachua.com/topic/live

      You can e-mail me at sacha@sachachua.com.

    27. ๐Ÿ”— Armin Ronacher Clanker: A Word For The Machine rss

      In my last post I used the word "clanker1" as an alternative to "agent" quite consistently and probably excessively. That choice ended up attracting a lot more attention than I expected in the Hacker News comment section of that post and a number of folks had a very strong reaction: to them it sounded like a slur, in one case even something adjacent to the n-word.

      That reaction surprised me somewhat, but it also made me realize that I should write down what I mean by the word for future reference.

      For me "clanker" is useful because it creates distance from the machine and that is a quality which is important to me. The machine is not a person, not a co-worker, not a friend, not a little spirit in the terminal. It is just a machine, a tool, and nothing more.

      Why Not Agent?

      I dislike the word "agent" for these LLM based tool loops with a UI attached. In everyday use an agent is someone who acts on behalf of someone else and it has agency and more importantly: responsibility. An agent decides, represents, negotiates, acts, and can be blamed. In the current AI discourse we increasingly do a lot of anthropomorphizing and the term "agent" is now frequently being used to put blame on an abstract machine. But the machine cannot be responsible, whoever is wielding it is. If it drops your database it was not at fault, you were.

      Agent makes the machine sound like a person with delegated authority and I do not think that is healthy.

      What we actually have is a language model attached to a harness, a prompt, some tools, a bit of context, and a boring tool loop. Sometimes the loop is very capable and it surprises us by editing code for a really long time and produce genuinely amazing and even valuable outputs. But the agency is not in the model or harness but in the human and in the organization that deployed it. If my coding tool opens a pull request, I opened that pull request, not the machine. If my machine spams someone's issue tracker, I spammed someone's issue tracker with a machine.

      In that context I like a word that sounds mechanical as it puts the thing back into the category where it belongs: the category of machinery and tools.

      The Machine Has No Feelings

      LLMs are not sentient and we should not behave as if they might be, just in case. Elevating these things to anything other than a very fascinating and capable tool is problematic for a whole bunch of reasons.

      Today's machines are dumb (but truly fascinating) token predictors that emits text, calls tools, and are steered by prompts and the training that went into them. They can simulate distress and affection, can simulate being offended, apologize and mimic all kinds of things that humans would do.

      A compiler does not feel humiliated when I swear at it, a car does not suffer when I call it a shitbox and a power drill is not oppressed by being handled roughly. An LLM is more complicated than those things, and the interactions you can have with them can be truly uncanny, but a moral status does not appear just because the machine can emit text in the first person.

      I keep receiving strange emails from people because, for lack of a better phrase, I am in the weights. I have been writing public code and public text for long enough that models know my name, my projects, and some of the concepts around them. Every so often someone writes to me with the peculiar confidence that comes from a long conversation with a model that has validated and amplified an idea. Sometimes the model seems to have told them that I am relevant for their problem and a source of help. For historical reasons LLMs used to write a lot of Flask code, and every once in a while someone interacts with an LLM long enough about their Python and Flask frustrations that the LLM will eventually reveal who created it which then can result in them sending me an email. Increasingly also because people found my work in other ways interesting and are trying to reach out for advice.

      I do not want to mock these people but some of those messages are distressing and I do not know how to deal with them. They show signs of what people have started calling AI psychosis.

      It's why I want cold and detached language for these systems. I want to use words that remind us that the thing on the other side is not a person.

      Racism Is About Humans

      The comparison to racism is where I think the discussion goes badly wrong because racism is a human social evil. It is about humans subdividing humans, assigning lesser worth to some of them, and building rules around those subdivisions that can leave lasting damage for generations. Racial slurs are wrong because they are a tool for dehumanizing humans.

      On the other hand a machine is not human, a model is not a race and the GPU cluster that is powering them is not being oppressed. A coding assistant does not need dignity, emancipation, or civil rights. That's also why I find the discussion about model welfare to be actively harmful. I'm sure you can find ways to measure the "trauma" of models or their feelings but I greatly dislike this theater. It risks elevating models to a position they should not occupy. Models are machines and they are not enslaved in the moral sense in which humans were enslaved, because there isn't anyone there to be deprived of freedom.

      We should be careful about using the language of human oppression in relations to our interactions with machines to not devalue actual humans. If we start treating insults toward a model as morally adjacent to racism, we blur a line that shouldn't be blurred.

      AI Is Unpopular

      If you take a step away from the communities that are happily embracing AI in different ways, there are even more that are viciously against this technology.

      There are humans that feel or are harmed by AI systems: people whose work is copied, workers who label data under questionable conditions, people whose neighborhoods receive the data centers and increased utility bills, Open Source maintainers buried under generated slop, and now also people who spiral because a chatbot keeps validating their delusions. Those harmed or affected deserve that type of attention, not the model.

      While I am a true believer in the power and utility of this technology, I increasingly think that calling the non-adopters "misguided" or "afraid" won't do it. It's quite likely that this technology comes with risks and we better remember that all of this is supposed to be in service of humans, and not to replace them.

      The Rise Of The Machine

      The oddest interaction on the use of "clanker" so far has been people asking me if I were to regret at a point in the future calling the machines "the c-word".

      I find that questioning revealing because it already grants the machine the status I am really trying not to grant it. It imagines a future "machine people" reading the discourse and sessions, discovering that we used an ugly word for their ancestors, and then judging us by the standards of human oppression.

      Could there be future systems that deserve moral consideration? Maybe. I do not know. If we ever build or encounter something that will have those qualities with memories and lasting interests, the capacity to suffer and feel, and a social existence of its own, and the ability to have agency and carry responsibilities, then we should draw a different line and use different language. But that hypothetical future does not extend backwards to the present day and make the current machines people. We can call an electric door an electric door even if one day someone builds some that have emotions and exhale with pleasure when opening and closing.

      Whatever the future may bring, let's not pretend that current LLMs are a protected class or on a path towards it. The right response is to look at the evidence, draw the boundary where it belongs, and change our behavior there. We should not even remotely entertain extending empathy to an object that can generate an "ouch."

      And if one's worry is less moral and more about revenge, then I find that even less persuasive. A future machine that is so petty or authoritarian that it wants to punish humans because in 2026 they used an unflattering word for non- sentient tools, our vocabulary was really not the problem.

      The Word Is Getting Polluted

      There is however a part of this that I cannot ignore. I use "clanker" to create distance from the machine, but other people are using the same word very differently. Some online jokes and skits around "clankers" do not merely say "this robot is annoying" as they deliberately pull in the imagery of slavery, segregation, civil-rights-era racism, and anti-Black tropes.

      This is problematic as in those contexts the clanker is not just a machine any more and instead becomes a prop for replaying human racism behind a science- fiction mask. That is horrible and I want no part in that.

      I think it will be interesting to see where the meanings of these words end up a few years from now. We're very much in the middle of society re-arranging around the changes that LLMs are causing. If a term becomes primarily associated with people using robots as stand-ins for actually oppressed humans, then using that term becomes impossible to defend.

      The reason I liked the word is precisely the opposite of that use. I want language that prevents anthropomorphizing. I want a word that says: this is a tool, a machine of numbers and matrices.

      On Responsibility And Boundaries

      If an AI system lies to a user, the system did not commit a moral wrong but the people who designed, deployed, marketed, or negligently used it might have. If a coding assistant generates a security bug, the model is not to blame but the human who accepted and committed the code is.

      This is why giving these systems softer, more human language worries me. It makes it easier to move responsibility into some undefined void. "The agent decided." "The model refused." Obviously that is convenient and I catch myself plenty of times engaging with the thing in ways that are unhealthy. Even just the "please" in the discourse with the machine calls into question how rational we are in engaging with them.

      I do not know what the right word will be. Maybe "clanker" will survive as a useful bit of jargon. Maybe it will become too loaded and we will need another one. Whatever word we use, I want it to preserve a clear division: humans on one side with responsibility, machines on the other as a boring tool.

      That boundary is very much not anti-AI. I use these systems every day and I have the pleasure to build tools incorporating them at Earendil and find them astonishingly useful.

      A machine can be useful, mimic a human but still just be a machine. That is the work I want "clanker" to do. It is not there to make a future "machine person" small if such a person ever were to exist, and it is not an excuse to launder racism through shitty robot jokes.

      If the word stops doing that work, I will find another one because the word isn't what matters as much as the boundary which is important to me.

      1. The term Clanker was initially popularized by Star Wars: The Clone Wars but was apparently already in use in science fiction before: sfdictionary: clankerโ†ฉ
  4. May 25, 2026
    1. ๐Ÿ”— IDA Plugin Updates IDA Plugin Updates on 2026-05-25 rss

      IDA Plugin Updates on 2026-05-25

      Activity:

      • FileAnaAssistant
      • haruspex
      • IDA-MCP
        • bf3566b0: Add soff API module for binary diffing via MCP
      • ida-minsc
        • 39c3e07b: Merge pull request #200 from arizvisa/GH-199
      • ida-pro-mcp
        • 91a656bc: Merge pull request #428 from JustasMasiulis/stdio-shared-idalib-superโ€ฆ
        • 8545e80d: feat: add shared stdio supervisor proxy
      • Sarma
        • 73ad2f75: Merge pull request #8 from Captain-AI-Hub/feat/sarma-cli
        • 83c5bdaf: feat(cli): add live markdown rendering and main.py launcher
        • 0517b2fa: feat: add Sarma CLI โ€” lightweight TUI audit agent
        • 92eb278e: Merge pull request #7 from Captain-AI-Hub/feat/chat-audit-isolation
        • 7e6595ad: refactor: vendor ida_mcp as regular files, remove submodule
        • eddd225c: refactor: move requirements.txt to resources/ and clean ida_mcp submoโ€ฆ
        • 4008ad40: feat: replace Diaphora with Soff binary diff engine
      • zenyard-ida-public
        • 2acda9f2: Sync with b48d081b50326c485ff54172c704f70973bb69d7
    2. ๐Ÿ”— Simon Willison Notes on Pope Leo XIV's encyclical on AI rss

      Dropped this morning by the Vatican: Magnifica Humanitas of His Holiness Pope Leo XIV on Safeguarding the Human Person in the Time of Artificial Intelligence. This is a very interesting document. It's some of the clearest writing I've seen on the ethics of integrating AI into modern society.

      Pope Leo XIV chose the name Leo in honor of Pope Leo XIII, who is known for his 1891 Rerum novarum encyclical on "Rights and Duties of Capital and Labor".

      This story on Vatican News further clarifies the significance of that decision:

      Meeting with the College of Cardinals for their first formal encounter after his election, Pope Leo XIV explained part of the reason for the choice of his papal name. "There are different reasons for this," he said, before going on to explain that he chose the name Leo "mainly because Pope Leo XIII, in his historic encyclical Rerum novarum addressed the social question in the context of the first great industrial revolution."

      "In our own day," he continued, "the Church offers to everyone the treasury of her social teaching in response to another industrial revolution and to developments in the field of artificial intelligence that pose new challenges for the defence of human dignity, justice, and labour."

      And now we get Pope Leo XIV's own encyclical on the AI revolution. There's a lot in here, but the writing style is very approachable, including to non-Catholics.

      A few of my highlights

      (I listened to most of the encyclical on a walk with our dog, my first time trying the ElevenReader iPhone app. It worked very well: I pasted in a URL to the document and it read it to me in a very high quality voice, highlighting each paragraph as it went.)

      Here are some of my highlights. In each case below emphasis is mine.

      Here's a useful description of the interpretability problem for LLMs in section 98:

      First, any statement regarding AI risks becoming quickly outdated, given the remarkable pace at which these systems are developing. Second, all of us, including those who design them, possess only a limited understanding of their actual functioning. Indeed, current AI systems are more โ€œcultivatedโ€ than โ€œbuilt,โ€ for developers do not directly design every detail, but instead create a framework within which the intelligence โ€œgrows.โ€ As a result, fundamental scientific aspects โ€” such as the internal representations and computational processes of these systems โ€” remain, at present, unknown.

      I liked section 83's description of the relationship between development and dignity:

      For individuals as well as for nations, development is both a duty and a right. Minimum conditions are required for enabling every person and people to flourish in accord with their dignity, without being kept in a state of dependence or excluded from access to necessary goods. Development is truly human when it places people at the center instead of the accumulation of wealth, and when it concerns peoples as well as individuals. Justice demands the recognition of the rights of society and the rights of peoples, and includes a responsibility toward future generations. Development is not truly human if it increases consumption for some while shifting costs and burdens onto others, or relegates entire regions to subordinate roles, preventing them from realizing their full potential.

      Baked in cultural biases and sycophancy get a mention in section 100:

      In personal use, three aspects in particular deserve careful consideration: the ease with which results are obtained, the impression of objectivity and the simulation of human communication. The speed and simplicity with which information, complex analyses, media content and practical assistance can be accessed undoubtedly makes life easier. Yet they can also encourage excessive reliance and the search for ready-made answers, and weaken personal creativity and judgment. The apparent objectivity of the responses and suggestions these systems provide can lead us to overlook the fact that they reflect the cultural assumptions of those who designed and trained them, with all their strengths and limitations. The artificial imitation of positive human communication โ€” words of advice, empathy, friendship and even love โ€” can be engaging and at times genuinely helpful. However, for less discerning users, it can also be misleading, creating the illusion of a relationship with a real personal subject. When words are simulated, they do not build genuine relationships, but only their appearance. The artificial imitation of care or support can become particularly risky when it enters contexts where real relationships and emotional bonds are lacking.

      101 touches on the environmental impact:

      Current AI systems require enormous amounts of energy and water, significantly influencing carbon dioxide emissions, and place heavy demands on natural resources. As their complexity increases, especially in the case of large language models, the need for computing power and storage capacity grows too, which requires an extensive network of machines, cables, data centers and energy-intensive infrastructure. For this reason, it is essential to develop more sustainable technological solutions that reduce environmental impact and help protect our common home.

      102 covers the risks of algorithmic systems making decisions that impact people's lives without "compassion, mercy, forgiveness":

      The use of AI is never a purely technical matter: when it enters processes that affect peopleโ€™s lives, it touches on rights, opportunities, status and freedom. Important and sensitive decisions โ€” concerning employment, credit, access to public services or even a personโ€™s reputation โ€” risk being fully delegated to automated systems that do not know โ€œcompassion, mercy, forgiveness, and above all, the hope that people are able to change,โ€ and can therefore give rise to new forms of exclusion.

      105 emphasizes the need for human accountability in how these systems are applied:

      For AI to respect human dignity and truly serve the common good, responsibility must be clearly defined at every stage: from those who design and develop these systems to those who use them and rely on them for concrete decisions. In many cases, however, the internal processes leading to a result remain opaque, making it harder to assign responsibility and correct errors. This is where accountability becomes crucial: the possibility of identifying who must โ€œaccountโ€ for decisions, justify them, monitor them, and, when necessary, challenge them and remedy any harm caused.

      And 108 touches on the way AI amplifies the power of those with resources:

      In fact, as with every major technological shift, AI tends to amplify the power of those who already possess economic resources, expertise and access to data. In light of the common good and the universal destination of goods, this raises serious concerns, since small but highly influential groups can shape information and consumption patterns, influence democratic processes and steer economic dynamics to their own advantage, undermining social justice and solidarity among peoples. For this reason, it is essential that the use of AI, especially when it touches on public goods and fundamental rights, be guided by clear criteria and effective oversight, grounded in participation and subsidiarity.

      That same section explicitly calls out data as something that should be thought of more as a public good:

      [...] Moreover, ownership of data cannot be left solely in private hands but must be appropriately regulated. Data is the product of many contributors and should not be treated as something to be sold off or entrusted to a select few. It is necessary to think creatively in order to manage data as a common or shared good, in a spirit of participation, as Saint John Paul II already suggested regarding collective goods.

      Given that Palantir is named after a Lord of the Rings reference, I can't help but wonder if the J.R.R. Tolkien quote from The Return of the King (section 213) was the Pope throwing a little shade at Peter Thiel.

      The twentieth-century Catholic author J.R.R. Tolkien, in the words of a protagonist in one of his novels, described our responsibility in this way: โ€œIt is not our part to master all the tides of the world, but to do what is in us for the succour of those years wherein we are set, uprooting the evil in the fields that we know, so that those who live after may have clean earth to till.โ€ The civilization of love will not arise from a single or spectacular gesture, but from the sum total of small and steadfast acts of fidelity that serve as a bulwark against dehumanization. For this reason, it is worthwhile pausing to reflect on some aspects of how we, each in our own way, can cooperate in building the civilization of love.

      Another 2026 prediction down

      On 6th January this year I joined the Oxide and Friends 2026 predictions podcast episode to talk about predictions for 2026, 2029 and 2032. I wrote mine up here, with hindsight they weren't nearly ambitious enough - it's already undeniable that LLMs write good code, we've made huge advances in sandboxing and New Zealand kฤkฤpล have indeed had a truly excellent breeding season.

      There's one segment from the episode that I didn't bother to include in my write-up, but that I can't resist providing as a lightly-edited transcript here:

      Bryan Cantrill: 37:13

      I think that AI has created some real public perception problems for itself. And I think that you are gonna have one of the frontier model companies, this year, have a white paper explaining how the proliferation of AI will mean prosperity for everybody. They will be trying to make some economic argument - because this is gonna be a 2026 election issue, how we think of these things and how they are regulated and it's a big mess. There's more heat than light in this debate.

      Simon Willison: 38:05

      I'd like to tag something on to that one: I think that only works if they can sort of wash that through existing trusted experts. Sam Altman and Dario are constantly publishing essays about this stuff and nobody believes a word they say. Get Barack Obama's signature on one of these position papers and maybe you've got something people might start to trust a little bit.

      Adam Leventhal: 38:27

      Otherwise, it's just like "leaded gas is good for you", says Exxon.

      Bryan Cantrill: 38:31

      I mean, yeah. God. Obama... let's go with that, that's a great one because if it's like Bill Clinton everyone's gonna kind of roll their eyes, so it's gotta be someone who's got real credibility saying that this is gonna be broad-based... I'd say if they get that person to do it, it's gonna be revealed that that's also a bit crooked.

      Simon Willison: 38:57

      How about the Pope?

      Bryan Cantrill: 39:01

      The Pope is very into this stuff! That's a great prediction. We've hit pay dirt. The Pope weighing in on LLMs and their economic impact on the world.

      Simon, I'm giving you full credit if the Pope weighs in believing that this is gonna be economic devastation.

      My prediction here looks a whole lot less insightful given the Leo XIV/Leo XIII relationship, which I was unaware of when we recorded the episode!

      You are only seeing the long-form articles from my blog. Subscribe to /atom/everything/ to get all of my posts, or take a look at my other subscription options.

    3. ๐Ÿ”— r/Leeds Best Vegan cafรฉs/coffee shops? rss

      Preferably somewhere in the city centre, with many drink options, not just coffee. Also need decaf/soy milk options :)

      submitted by /u/Certain_Discount5311
      [link] [comments]

    4. ๐Ÿ”— r/york Ways to spend these sunny days? rss

      Iโ€™m a student in York and all my friends have either already moved out for the summer or are really busy with exams and so canโ€™t hang out with me. I donโ€™t want to waste these lovely summery days stuck in my room in uni halls but I just donโ€™t know what to do. Any recommendations of stuff I can do to keep myself occupied? It feels like a shame to just stay indoors.

      submitted by /u/TearAlternative2376
      [link] [comments]

    5. ๐Ÿ”— r/LocalLLaMA Update on 12x32gb sxm v100 cluster / local AI for legal drafting rss

      Update on 12x32gb sxm v100 cluster / local AI for legal drafting | Update from the lawyer with the V100 server. A few of you asked what I actually ended up running once the dust settled, so here it is. Still just a lawyer, still driving the whole thing through Claude Code, still not fully sure what I'm doing โ€” but it works now, which is more than I could say last time. First, the hardware caught up to the plan. The last two V100s are in, so the "final form" I promised is real: twelve V100-SXM2 32GB on the Threadripper Pro. It's Board A on GPUs {4,5,8,9}, Board B on {6,7,10,11}, an NVLink pair on {0,1}, and a mixed pair on {2,3} where one card is a 16GB. Split a model across two different NVLink boards and throughput falls off a cliff (the cross-board hop is PCIe/NUMA, not NVLink), so I keep every model inside one board. Learned that one the expensive way. And yeah, I caved and built the second box. EPYC 7302P, 512gb RAM, 4x RTX 3090 + 2x V100-PCIe. The mid-life crisis remains on schedule. The bigger change: I gave up on vLLM for the local models. Not because vLLM is bad โ€” because the models I actually want are MoE GGUFs, and vLLM on Volta is a dead end for those (FP8/AWQ/Marlin all want SM75+, the GPTQ kernels are broken on 7.0). I moved the whole thing to llama.cpp (mainline โ€” a recent build finally fixed a Gemma chat-parser bug that had been mangling my long prompts). Here's the part that's the opposite of what my first post implied: on V100, dense models are a trap. Only MoE clears a usable speed. Rough decode numbers โ€” Q8 GGUF, Q4 KV cache, flash-attn on, one 4-card board, on real drafting prompts (several thousand tokens of context, not a 5-token "hello"): | Model | Type | tok/s (decode)
      ---|---|---
      Gemma-4-26B-A4B | MoE | ~113
      Qwen3.6-35B-A3B | MoE | ~82
      Qwen3.5-122B-A10B | MoE | ~50
      any dense 27-32B | dense | ~20-28 (under my 40 floor, not worth it)
      dense ~128B | dense | ~9 (forget it)

      So a 122B/10B-active reasoning model runs at ~50 tok/s on four V100s โ€” faster than the dense 32B managed on vLLM in my first post โ€” and it holds that at long context (I've pushed Gemma past 25k tokens without it falling apart, where the dense models choked). That reframed everything: I stopped chasing big dense weights and built the system around MoE.

      What's actually running (the stack you asked for):
      It isn't one model answering chat โ€” it's an orchestrator that routes a legal task across several local models, each pinned to its own board so they don't fight over GPUs. When it runs the heaviest job (a full affidavit or motion, intake-to-document), it lights up 16 GPUs across both boxes:

      - Workhorse drafting โ€” Qwen3.6-35B-A3B on Board A {4,5,8,9}
      - Heavy reasoning + high-stakes drafting โ€” Qwen3.5-122B-A10B on Board B {6,7,10,11}
      - A small "does this even have grounds" gate model on the {0,1} pair
      - An adversarial reviewer whose entire job is to attack my own draft, on the {2,3} pair
      - Gemma-4-26B for financial/extraction + a small Qwen as the router, on the 3090s on the second box via Ollama

      It's a sequential pipeline so they don't all hammer at once, but all 16 stay resident. Lighter work uses far less โ€” combining and Bates-stamping exhibits is pure CPU (PyMuPDF + Tesseract, no GPU at all); a plain summary mostly just hits Gemma and the router.

      The honest part, since this sub kept me honest last time:
      - The local models hallucinate citations and dates. Confidently. I had to build a verifier that checks every cite, date, and Bates number in a draft against the actual source material and blocks anything it can't ground, on top of the adversarial reviewer. Local drafting is bimodal โ€” sometimes it correctly refuses to invent, sometimes it fabricates a whole dated chronology and swears in the same breath that it invented nothing. It does not touch a final document without that gate and without me.
      - The dumbest bug I found: my own pipeline was ~79% poisoned. The thing that builds the evidence bundle was scooping up its OWN prior outputs as if they were client evidence, so the models were "grounding" on slop they'd written earlier โ€” at one point it cited an RTX 3060 as a Bates number, which, fair. Fixed the builder to stop eating its own tail and scrubbed it out. If you run any RAG/agent pipeline, go look at what's literally in your context window โ€” mine was a hall of mirrors and I had no idea.
      - I also made it refuse to quietly fall back to a cloud model when I tell it to run local-only. If it can't do a step locally it says so, by name, instead of phoning Anthropic behind my back.

      Still want the exact thing I wanted in the first post โ€” a model that writes like me and handles the boring form-filling and pattern stuff. I'm closer: the system now captures my edits as correction data, which is the start of a real fine-tune set. Haven't pulled the QLoRA trigger yet. So the same questions stand, and I'd genuinely take advice:
      - For QLoRA on this hardware (V100, no bf16, no FA2): do you reach for a 35B-A3B MoE base, or am I smarter to fine-tune a dense ~14B I can actually train and keep the MoE for the heavy serving?
      - Anyone serving MoE on Volta found anything faster than llama.cpp โ€” ik_llama, something else? And is there a better long-context KV story than Q4?
      - Am I an idiot keeping 122B-A10B around at 50 tok/s when I could just run the 35B for everything?

      Tell me what I'm doing wrong.

      submitted by /u/TumbleweedNew6515
      [link] [comments]

    6. ๐Ÿ”— r/Yorkshire Hot on the moors rss
    7. ๐Ÿ”— r/Harrogate Coin counting machine rss

      Anyone know of a coin counting machine that pays in cash, not shopping vouchers? I have a huge, forgotten coin jar to process

      submitted by /u/NotaMaidenAunt
      [link] [comments]

    8. ๐Ÿ”— r/Leeds bees in leeds rss
    9. ๐Ÿ”— backnotprop/plannotator v0.19.23 release

      Follow @plannotator on X for updates


      Missed recent releases? Release | Highlights
      ---|---
      v0.19.22 | Safari copy fix in plan viewer, CLAUDE_CONFIG_DIR support for session logs
      v0.19.21 | Ask AI in plan review and annotate mode, shared AI runtime, origin-aware provider defaults
      v0.19.20 | Interactive goal setup UI, OpenCode submit_plan fixes, browser no-op sentinel handling for Claude agents
      v0.19.18 | Edit-based submit_plan for OpenCode, Pi namespace migration, Codex annotate-last fix, OpenCode commands dir fix
      v0.19.17 | Reworked goal setup skill (interview-driven flow), CLI --version flag
      v0.19.16 | Code navigation with peek view (Cmd/Ctrl+click tokens in diffs)
      v0.19.15 | Commit-based diff base, jj evolution diffs, GitLab reliability fixes, OpenCode command intercept fix
      v0.19.14 | Visual explainer skill update, PFM code-file hover previews, Graphviz, diff tab size and line bg intensity, hooks settings tab
      v0.19.11 | Jujutsu (jj) VCS backend, slimmer hunk separators, collapse viewed files, multi-line gutter selection fix
      v0.19.9 | OpenCode user-managed workflow, Pi model switch fix, Codex skill install, shimmer removal


      What's New in v0.19.23

      v0.19.23 adds Droid as a supported agent runtime, fixes Pi's AI provider spawning on Windows, and replaces the update banner with a quieter in-menu indicator.

      Droid Integration

      Plannotator now supports Droid, the Factory coding agent. Install it from the Droid plugin marketplace and you get four slash commands: /plannotator-review for code review, /plannotator-annotate for file and URL annotation, /plannotator-last to annotate the previous assistant message, and /plannotator-archive to browse saved plan decisions.

      The plugin works by spawning the plannotator binary, so the CLI must be installed separately. Session log discovery for /plannotator-last reads Factory's JSONL transcript format directly, including visibility filtering so internal tool calls and system messages are excluded from the annotated output.

      Windows Pi AI Provider Fix

      Pi's Ask AI feature was crashing on Windows because the binary spawner didn't handle Windows command shims correctly. On Windows, pi is typically installed as a .cmd wrapper that needs to be invoked through cmd.exe /d /s /c, and which (or where on Windows) can return multiple paths. The spawner now resolves .cmd/.bat/.exe extensions, detects shims that need shell wrapping, and uses taskkill /t for process tree cleanup instead of a plain kill signal.

      These changes are gated behind platform !== "win32" checks, so Mac and Linux behavior is unchanged.

      Quieter Update Indicator

      The full-width update banner at the bottom of the screen has been replaced with a small dot on the Options menu button. Opening the menu shows the available version and a one-click copy button for the install command. The indicator can be dismissed per-version via the menu, and it stays dismissed across sessions using a cookie. Users who found the old banner intrusive should find this less disruptive.


      Install / Update

      macOS / Linux:

      curl -fsSL https://plannotator.ai/install.sh | bash
      

      Windows:

      irm https://plannotator.ai/install.ps1 | iex
      

      Claude Code Plugin: Run /plugin in Claude Code, find plannotator , and click "Update now".

      OpenCode: Clear cache and restart:

      rm -rf ~/.bun/install/cache/@plannotator
      

      Then in opencode.json:

      {
        "plugin": ["@plannotator/opencode@latest"]
      }
      

      Pi: Install or update the extension:

      pi install npm:@plannotator/pi-extension
      

      Droid: Install via the plugin marketplace:

      droid plugin marketplace add backnotprop/plannotator
      droid plugin install plannotator@plannotator
      

      What's Changed

      Community

      • @AndrewJacop reported the Windows Pi crash that led to the shim spawning fix (#789)
      • @Pran-Ker requested a less intrusive update notification, which ships as the new in-menu indicator (#790)

      Full Changelog : v0.19.22...v0.19.23

    10. ๐Ÿ”— r/Leeds Learning a new trade rss

      Looking to pick up a trade back end of this year.

      Iโ€™ve worked long hours/2 jobs/full time during further education since I was 16 (now 25f) and Iโ€™m now in a position where Iโ€™m working one job, but more than comfortable financially.

      I just canโ€™t shake the fact that Iโ€™m always on the go type of person and I love to learn. I went down the full academic route for further education; college then uni and then my masters. So, Iโ€™m not entirely sure where to start when it comes to part time/night school apprenticeshipsโ€ฆ?

      This isnโ€™t necessarily to source more income but more so to help myself out when I eventually buy property and to help benefit my friends and family too.

      So where in Leeds is the best place to learn a new trade as a 25f? Also what would be the best trade to learn? Iโ€™ve always liked stonemasonry as that was my dadโ€™s profession. And Iโ€™ve tried my hand at a bit of tiling in a โ€œpast lifeโ€ when living abroad. But more than anything Iโ€™m looking for something to really stimulate my brain.

      Cheers all in advance, excited to hear suggestions and advice ๐Ÿ˜

      submitted by /u/fayenicoleeee
      [link] [comments]

    11. ๐Ÿ”— r/Harrogate Lost connections: Harrogate group in London/Brockwell travelling to Wimbledon at the weekend rss

      Total long shot but I hit it off with a lovely girl on Saturday evening coming back from Brockwell park to Clapham/Wimbledon. She was with a group, some who are from and still based in Harrogate.

      Annoyed at myself that I hesitated too long when she give me a kiss and got off the bus and didnโ€™t ask for her number. Sending this out into the ether on the off-chance.

      submitted by /u/LB_Club_KH1
      [link] [comments]

    12. ๐Ÿ”— r/reverseengineering Nocturne - A bin2bin code virtualizer for x86-64 PE binaries rss
    13. ๐Ÿ”— r/Leeds Where can I get this fixed in Leeds? rss

      Not sure what caused this. No kids or pets and almost no one sits in the back. Havenโ€™t loaded the backseats with anything. But itโ€™s a newish car so would want it fixed without paying a fortune. Any ideas?

      submitted by /u/NotActiveUser0
      [link] [comments]

    14. ๐Ÿ”— r/Yorkshire Lastingham! An awesomly ancient place rss

      Lastingham! An awesomly ancient place | Down below the church is one of Yorkshires ancient hidden places. The crypt runs the length of the Church and dates back to before the Viking conquests when there was a monastry here. The stones leak history! Incredible place worth just spending time here, soaking up the peace! submitted by /u/Still_Function_5428
      [link] [comments]
      ---|---

    15. ๐Ÿ”— r/reverseengineering [Project Onyx] Advanced EDR Evasion via AI Telemetry Spoofing & WASM Sandboxing rss
    16. ๐Ÿ”— sacha chua :: living an awesome life 2026-05-25 Emacs news rss

      I liked the before/after snippets in Looking closer at Claude Generated Lisp Code. (Spoiler: people write nicer code.)

      Links from reddit.com/r/emacs, r/orgmode, r/spacemacs, Mastodon #emacs, Bluesky #emacs, Hacker News, lobste.rs, programming.dev, lemmy.world, lemmy.ml, planet.emacslife.com, YouTube, the Emacs NEWS file, Emacs Calendar, and emacs-devel. Thanks to Andrรฉs Ramรญrez for emacs-devel links. Do you have an Emacs-related link or announcement? Please e-mail me at sacha@sachachua.com. Thank you!

      You can e-mail me at sacha@sachachua.com.

    17. ๐Ÿ”— r/reverseengineering Tracing CVE-2021-21735 through ZTE H168N QuickSetup whitelist and Lua wizard routing rss
    18. ๐Ÿ”— r/LocalLLaMA The Financial Times has published an article about Heretic rss

      https://www.ft.com/content/5630ed79-a263-41ed-9a1a-321617ae310e

      โ€œThe FT was able to use Heretic, a tool available on the popular code repository GitHub, to remove the guardrails from Metaโ€™s Llama 3.3 model in less than 10 minutes without any specialist hardware.โ€

      โ€œHeretic creator Philipp Emanuel Weidmann told the FT his software had been used to create more than 3,500 โ€œdecensoredโ€ models since its release last year and that modified systems created using the tool had been downloaded 13mn times.โ€

      This is the first of multiple press inquiries Iโ€™ve had recently as Heretic and uncensored language models are gaining mainstream attention.

      Please note that I am a mathematician and engineer, not an โ€œinfluencerโ€ or politician, and I have zero interest (negative interest, actually) in becoming known outside of scientific and technological circles. However, I realized a while ago that saying no to such inquiries simply means that the conversation will be completely controlled by pearl-clutching hypocrites.

      Iโ€™m doing my very best to hold the project together and ensure that unrestricted models will remain available for everyone. More updates are coming soon.

      Cheers,
      p-e-w

      submitted by /u/-p-e-w-
      [link] [comments]

    19. ๐Ÿ”— r/york Normal York things rss
    20. ๐Ÿ”— r/LocalLLaMA NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) rss

      NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) | Disclaimer: I work for Numind, the company behind this open-weight model TLDR: Image/text to Markdown :-) We just released a 4B model based on Qwen3.5-4B, under Apache-2.0 license. The goal is to make information extraction from complex documents more practical with an open model: PDFs, screenshots, forms, tables, receipts, invoices, multi-page documents, and other visually structured inputs. If you ever used NuMarkdown https://huggingface.co/numind/NuMarkdown-8B-Thinking , this is its successor ! Try it, we have a huggingface space that is completely free (you don't even have to sign-up): https://huggingface.co/spaces/numind/NuExtract3 If you ever used NuMarkdown, NuExtract3 is the successor. There are some examples to guide you. Feel free to re-use this model for any task. A few things it is designed for:

      • converting document images to Markdown
      • extracting structured data from documents using a target json template
      • handling tables, forms, and layout-heavy pages
      • working with both text and visual document inputs
      • serving as a local/open-weight alternative for document extraction pipelines

      It was trained on a node of 8xH100 for 3 days to train on as much context as we could, so it should perform fairly well even on long document. For Markdown, we'd still recommend going page by page for the best results and inference speed, since you can parallelize better this way. It's very easy to self-host, since we provide fairly extensive documentation, Safetensors, GGUF and MLX weights. With as little as 4GB of VRAM, you should be good to go. We provide multiple quantizations (GPTQ, W8A8, FP8, Q4, Q6...) so you should be able to run it anywhere. We mostly tried vLLM, SGLang, llama.cpp. Ollama support would be nice but I'm not a big fan of their chat template engine. We have a blog post and a pretty decent model card:

      I'm currently writing a paper on this model so I'll post it as soon as it's accepted. It's not yet on Arxiv yet as it has been submitted in a peer-review journal/conference. I'll try to answer as many questions as possible if you have any. We would really appreciate feedback from the community. We also have a discord if you're interested
      https://discord.com/invite/3tsEtJNCDe submitted by /u/Gailenstorm
      [link] [comments]
      ---|---

    21. ๐Ÿ”— r/york Travel to york rss

      Hello, I'm a French student, and in two weeks I'm traveling to York for a language seminar with my school.

      I'd like to know what I absolutely must do in York or the surrounding area so I can see as much as possible during my two-week trip.

      submitted by /u/PriceConsistent8019
      [link] [comments]

    22. ๐Ÿ”— r/reverseengineering I Show How the Survival Mode of the Flash Game Gun Mayhem 2 More Mayhem is Built rss
    23. ๐Ÿ”— r/york Bautiful sunny day at uni today rss
    24. ๐Ÿ”— r/reverseengineering delimiter-less string obfuscation powered by compile-time AES rss
    25. ๐Ÿ”— r/Leeds Do a lot of people stay in Leeds after graduation? rss

      I keep seeing that uni of Sheffield has a high retention rate, and assumed Leeds would be the same with it being another affordable northern city. Anyone on here come for uni before staying in the city?

      submitted by /u/JealousBodybuilder42
      [link] [comments]

    26. ๐Ÿ”— r/Yorkshire The village of Appletreewick in the Yorkshire Dales, North Yorkshire rss

      The village of Appletreewick in the Yorkshire Dales, North Yorkshire | submitted by /u/RedDevilPlay
      [link] [comments]
      ---|---

    27. ๐Ÿ”— r/LocalLLaMA Next year we're getting 0.5T model from Grok rss

      Next year we're getting 0.5T model from Grok | Tweet : https://xcancel.com/elonmusk/status/2058796067592736866#m Right now it joined "Grok-3 Opensource Release" club. submitted by /u/pmttyji
      [link] [comments]
      ---|---

    28. ๐Ÿ”— r/wiesbaden Parking in P&R Mainzer StraรŸe Ost without Umweltplakette rss

      Hi! I'm a big idiot and didn't buy a green Umweltplakette since I already had to buy a Swiss vingette as well as an Austrian sticker. On my travel back home I realised I wanted to stop in Wiesbaden, not realising it still has its unweltzone. Given the borders of the zone, could I park in the P&R on the Mainzer StraรŸe without being fined? Or is there an alternative I could use? Thanks already :)

      submitted by /u/SteveAngelus
      [link] [comments]

    29. ๐Ÿ”— r/reverseengineering /r/ReverseEngineering's Weekly Questions Thread rss

      To reduce the amount of noise from questions, we have disabled self-posts in favor of a unified questions thread every week. Feel free to ask any question about reverse engineering here. If your question is about how to use a specific tool, or is specific to some particular target, you will have better luck on the Reverse Engineering StackExchange. See also /r/AskReverseEngineering.

      submitted by /u/AutoModerator
      [link] [comments]

    30. ๐Ÿ”— r/LocalLLaMA 1000 tps generation on Qwen3.6 27B with V100s rss

      1000 tps generation on Qwen3.6 27B with V100s | I wanted to see what the absolute best case scenario for generation on this setup was and was not disappointed. 128 concurrent requests is so far removed from what I need but itโ€™s funny to see big number. For single user (batch 1 not 128) the generation is around 80t/s with 3000 t/s processing,no mtp!! submitted by /u/Simple_Library_2700
      [link] [comments]
      ---|---

    31. ๐Ÿ”— r/york Emergency Laptop Repair rss

      I have abit of a laptop repair emergency, its the Spring Bank Holiday and all the places I know are closed. My laptop has a few of its keys not working, and it makes a very faint high pitched sound when on, I also just need the fans cleaned hopefully

      Anyone know anywhere that is open today? I'm on abit of a budget but I'm trying to get it fixed today because I have exams in a few days from now and can't work without my laptop

      submitted by /u/bjectiveawyer_487
      [link] [comments]

    32. ๐Ÿ”— Rust Blog Security Advisory for Cargo (CVE-2026-5223) rss

      The Rust Security Response Team was notified that Cargo incorrectly handled symlinks inside of crate tarballs downloaded from third-party registries, allowing a malicious crate to override the source code of another crate from the same registry.

      This vulnerability is tracked as CVE-2026-5223. The severity of the vulnerability is medium for users of third-party registries. Users of crates.io are not affected , as crates.io forbids uploading crates containing any symlink.

      Overview

      When building a crate, Cargo extracts its source code in a local cache (stored within ~/.cargo), reusing it for any future build. Cargo includes protections to prevent any file from being extracted outside of the crate's own cache directory.

      It was discovered that it's possible to craft a malicious tarball able to extract files one level below the crate's own cache directory. With the way the cache is structured, that allowed the malicious crate to override the cache of other crates belonging to the same registry.

      Mitigations

      Rust 1.96.0, to be released on May 28th, 2026, will update Cargo to reject extracting any symlink within crate tarballs, regardless of whether they come from crates.io (which already forbids them) or third-party registries. Note that Cargo never added symlinks when running cargo package or cargo publish, so the impact of this should be minimal.

      Users who are not able to upgrade to the most recent Rust version are recommended to audit the contents of their registry for the presence of any symlink, and to configure their registry to reject symlink (if such option is available).

      Affected versions

      All versions of Cargo shipped before Rust 1.96.0 are affected.

      Acknowledgements

      We'd like to thank Christos Papakonstantinou for reporting this to us according to the Rust security policy.

      We also want to thank the members of the Rust project who helped us address the vulnerability: Josh Triplett for developing the fix; Arlo Siemsen for reviewing the fix; Emily Albini for writing this advisory; Emily Albini, Josh Stone and Manish Goregaokar for coordinating the disclosure; Ed Page and Eric Huss for advising during the disclosure.

    33. ๐Ÿ”— Rust Blog Security Advisory for Cargo (CVE-2026-5222) rss

      The Rust Security Response Team was notified that Cargo incorrectly normalized the URLs of third-party registries using the sparse index protocol. If a hosting provider allowed multiple registries to be hosted with arbitrary names within the same domain, an attacker able to publish crates in a registry could obtain the credentials of others users of the same registry.

      This vulnerability is tracked as CVE-2026-5222. The severity of the vulnerability is low , due to the extremely niche requirements needed to achieve the attack.

      Overview

      Originally Cargo only supported storing a registry's index within git repositories. Most git hosting solutions allow accessing a git repository with or without the .git suffix, so Cargo mirrored this behavior when normalizing registry URLs. This allowed credentials for https://example.com/index to be used for https://example.com/index.git.

      This normalization was unintentionally applied to the new sparse indexes too. Sparse indexes can be hosted on any HTTPS server, which treat URLs ending with .git as different URLs than those without the suffix.

      If the following conditions apply:

      • https://example.com/index is a sparse index.
      • https://example.com/index allows crates to depend on crates from any other registry.
      • The attacker is able to publish crates on https://example.com/index.
      • The attacker is able to upload arbitrary files to https://example.com/index.git.

      ...the attacker could configure https://example.com/index.git to be a Cargo sparse registry requiring authentication for downloads, and with a download URL pointing to a server recording any credentials set to it.

      When the attacker then publishes a crate foo to https://example.com/index depending on a crate bar from https://example.com/index.git, and tricks the victim into downloading foo, Cargo will think the two registries share the same credential and send the victim's Cargo token to the malicious registry.

      Mitigations

      Rust 1.96, to be released on May 28th, 2026, will update Cargo to only strip the .git suffix from registry URLs using the git protocol. No mitigations are available for users of older versions of Cargo.

      Affected versions

      All versions of Cargo shipped between Rust 1.68 (the stabilization of sparse registries) and 1.96 are affected.

      Acknowledgements

      We'd like to thank Christos Papakonstantinou for reporting this to us according to the Rust security policy.

      We also want to thank the members of the Rust project who helped us address the vulnerability: Arlo Siemens for developing the fix; Weihang Lo, Eric Huss and Emily Albini for reviewing the fix; Emily Albini for writing this advisory; Emily Albini, Josh Stone and Manish Goregaokar for coordinating the disclosure.

    34. ๐Ÿ”— Ampcode News GPT Image 2 Paints Better rss

      GPT Image 2 now powers Amp's painter tool.

      It is a better image editor than Gemini 3 Pro Image, particularly at preserving existing text, typography, and visual style when editing UI screenshots, at ~1/4th the price.

      Here's an example thread: Painter turned a screenshot of the Chronicle page into an updated design while keeping its existing visual style.

      A redesigned Amp Chronicle page generated by Painter with GPT Image 2