fix: improve request viewer and concurrency

2026-04-29 21:40:22 +08:00
parent 5cc282eb14
commit 5c6754c6ae
10 changed files with 395 additions and 30 deletions
--- a/README.md
+++ b/README.md
@@ -8,7 +8,7 @@ The project is designed for tools such as Claude Code, Cline, Continue, OpenCode

 ## Current Version

-The current desktop line is `v1.2.0`.
+The current desktop line is `v1.2.1`.

 Release builds are produced by GitHub Actions for:

@@ -249,6 +249,21 @@ Priority order:
 4. command-line flags
 5. desktop Settings page updates

+## Concurrency
+
+Older builds rejected concurrent chat requests with a `rate_limit_error` saying the proxy handled one request at a time. Current builds use a small execution pool instead:
+
+- default max concurrent chat requests: `4`
+- override with `LINGMA_PROXY_MAX_CONCURRENT`
+- allowed range: `1` to `16`
+- `session_mode=auto` uses fresh Lingma sessions so parallel editor requests do not share one sticky session
+
+Example:
+
+```bash
+LINGMA_PROXY_MAX_CONCURRENT=8 lingma-ipc-proxy --port 8095
+```
+
 ## Function Calling / Tool Calling

 Lingma does not expose a native public OpenAI/Anthropic tool-call protocol, so this proxy emulates tool calling:
@@ -291,7 +306,7 @@ The desktop bundle name is always `Lingma IPC Proxy`.

 The release workflow is triggered by:

- pushing a tag such as `v1.2.0`
+- pushing a tag such as `v1.2.1`
 - manually running the `Release` workflow with a tag input

 Planned improvements: