Search

EN 中文日本語

# Archives

Browse all posts by publish date.

2026

19 entries

Jun 22, 2026 Posts · Article 1 min read Let Claude Code Automatically Generate Images with Your ChatGPT Subscription: No API Key, and How It Gets Around Turnstile Let agents like Claude Code and Cursor generate images along the way when writing docs or READMEs, without an OPENAI_API_KEY, without paying extra, and using the ChatGPT subscription you already have. This article dissects the most interesting implementation path behind it: the web backend. Why you can’t just POST directly, why the real wall among the three layers of anti-scraping is the single-use Turnstile token, and the complete flow for driving your already-logged-in Chrome to generate images. Jun 18, 2026 Posts · Article 1 min read Let Claude Code Draw the Image Itself: The Original Intention and Principles Behind chatgpt-imagegen When an AI agent needs an image while working, the traditional path either requires an API key and money, or a human has to go to ChatGPT, generate the image, and paste it back—the agent can only get stuck waiting. chatgpt-imagegen lets the agent generate images itself using your existing ChatGPT subscription: no API key required, no Codex quota consumed by default, and support for image-to-image generation. This article explains its original intention, how its two backends work, and why it is designed for agents. Jun 17, 2026 Posts · Article 1 min read Letting an Agent Click Into Cross-Origin Iframes: How chrome-use Took On This Hard Problem When connecting an AI agent to a browser, the hardest part isn’t opening pages. It’s the forms hidden inside cross-origin iframes: the agent can read them and fill them in, but it just can’t click “Save.” Here’s the story of how we tackled that hard problem. Jun 16, 2026 Posts · Article 1 min read Reversing a Continuously Hardened Target: Why Version Adaptation Is Getting Harder With the same macOS IM client, each upgrade makes reverse engineering more costly. This is a record of the step up from a “hot update” to a “minor version,” and why the old path of purely scanning memory has collectively failed in newer versions: what can be reused across versions is never hardcoded coordinates, but structural invariants and dynamic observation. Jun 12, 2026 Posts · Article 1 min read Racing Against WeChat Versions: The Adaptation History of a Local Tool Turning WeChat on macOS into a local interface for AI agents is not hardest the first time it works; the hard part is keeping it alive after WeChat updates again and again. This covers three generations of methods for obtaining the database key, how background message sending changed its routing approach, and why what truly carries across versions is the method for finding things, not any specific address. Jun 11, 2026 Posts · Article 1 min read iphone-use — Computer-use, but for the iPhone An open-source agent control layer for a real iPhone: live browser remote + a see/act HTTP API for AI agents. Three input layers (Shortcut verbs / element tree / pixel fallback), clean CJK input, hardware-validated. May 29, 2026 Posts · Article 1 min read Is the `cache 4m23s` Line in the Status Bar Actually Accurate? There’s a prompt-cache countdown line in the status bar I wrote for Claude Code. Where it anchors, how the formula works, and when it lies to you — let’s pull apart the source code alongside a real transcript. May 27, 2026 Posts · Article 13 min read From Soft Methods to Hard Patches — A Methodological Review of a macOS Mach-O Reverse Engineering Session Methodological notes from an 8-hour macOS binary reverse engineering session — why soft methods fail, when you have to switch to hex patching, how Ghidra/lldb/llvm-objdump work together, and how a “killer patch” mindset can be used in offensive and defensive drills. No specific target is disclosed; this only covers methods, tools, and troubleshooting techniques. May 26, 2026 Posts · Article 3 min read Facing AI Audits: How Can You “Legalize” Your Reverse Engineering Project? Examines a practical approach to handling decompilation projects by combining Claude Code and Gemini CLI, focusing on the challenge of ownership verification and resolving it through an automated sanitization process. May 12, 2026 Posts · Article 1 min read Dissecting the Implementation of the /goal Command Based on the Claude Code 2.1.139 headless source code, this breaks down how the /goal command drives the Agent to keep working until a condition is met through a session-level Stop hook. May 12, 2026 Posts · Article 1 min read claude agents and /bg: An Analysis of the Agent View Implementation Based on the Claude Code 2.1.139 source code, this article breaks down how claude agents, --bg, and /bg share the background job model, daemon, and FleetView TUI. Mar 23, 2026 Posts · Article 8 min read How I Verified the Local Database Unlock Chain in WeChat macOS 4.0.1.52 A retrospective on engineering-focused forensics conducted on my own device. By tracing static files, runtime open paths, and SQLCipher parameters in sequence, I ultimately confirmed that the local database unlock chain in WeChat macOS 4.0.1.52 is based on an account-level key distribution model. Mar 19, 2026 Posts · Article 1 min read If You Want to Automate Your Own WeChat Account, First Think Through These Three Layers If the goal is “automation for your own account,” what really needs to be separated first is not a feature checklist, but the listening layer, execution layer, and storage layer. Mar 18, 2026 Posts · PPT 1 min read From Assisted Coding to Integrated R&D: Putting AI Programming into Team Practice (PPT) A Slidev-style Chinese presentation translated into English, explaining Agentic Coding, Skills and MCP, AGENTS.md and CLAUDE.md, sandboxing and permission controls, and how a team can truly operationalize AI programming with Plans, a YApi Skill, and docs-sync. Mar 11, 2026 Posts · Article 1 min read How QClaw Works: How It Turns OpenClaw into a Desktop Application This article goes beyond the broad “control plane / execution plane” framing and instead walks through the current implementation piece by piece—covering the bridging layer, IPC, configuration fields, the WeChat flow, rollback mechanisms, and the evidence index—to explain how QClaw organizes OpenClaw into a deliverable desktop runtime. Mar 9, 2026 Posts · Article 1 min read OpenClaw v2026.3.8 Release Analysis: Backup, Remote Gateway, Talk Mode, and Multi-Endpoint Routing Continue to Improve Based on the official release notes, this article summarizes the key changes in OpenClaw v2026.3.8: backup commands arriving in the main workflow, macOS remote gateway onboarding, Talk silence timeout, Brave search integration, ACP receipts, and multi-platform routing fixes. Mar 5, 2026 Posts · Article 1 min read Designing Browser Automation Offense/Defense: Detection Models and a Layered Control Plane This article abstracts browser automation in highly adversarial environments into a multidimensional risk scoring system, and builds a layered control plane around three core dimensions: consistency, rarity, and temporal distribution. Mar 5, 2026 Posts · Article 1 min read Designing Attack and Defense Strategies for Cloudflare Turnstile: System Principles and the Control Plane Reframes the Turnstile risk model from a capability-token perspective, focusing on issuance/consumption semantics, scope binding, and execution integrity, and provides a defense prioritization for high-adversary environments. Mar 3, 2026 Posts · Article 1 min read Use agent-browser-stealth Instead of agent-browser For growth and promotion use cases: improve AI browser operability on high-risk-control sites like Amazon, and support reusing a user’s existing browser state.