3b56d7a11f [BugFix] DSV4 Initialize KV store for Decode Node after first real request (#9793) ### What this PR does / why we need it Base this change on the release branch version where external KV stores already initialize lazily from put(). The put-only path misses decode-only / pure consumer workers because they can run a real request without storing KV. This keeps the existing put-triggered initialization for producer/store paths, and adds a post-real-forward fallback: start_load_kv() marks the current connector step as a real forward only when forward_context is not None, and get_finished() initializes the backend store afterward if needed. The no-forward path also calls get_finished(), so it explicitly resets the current-step marker and will not initialize the store there. Lazy init only changes the initialization timing: once initialization is actually triggered, initialization failures still raise as fatal errors. This avoids initializing in connector construction, KV cache registration, or no-forward cleanup, while still covering workers that do not call put(). vLLM version: v0.20.1 vLLM main: vllm-project/vllm@c7aa186 --------- Signed-off-by: Pz1116 <zpbzpb123123@gmail.com>
3ead473ecd [Doc][Misc] Update v0.20.2rc README information (#10103) ### What this PR does / why we need it? Update README and docs version references for the v0.20.2rc release candidate. - Add v0.20.2rc1 release candidate news to English and Chinese README files. - Add releases/v0.20.2rc to the maintained branch tables. - Update docs header and Sphinx release metadata for v0.20.2rc. ### Does this PR introduce _any_ user-facing change? No. Documentation-only update. ### How was this patch tested? - `git diff --check` - `bash format.sh ci` partially passed; failed only because `shellcheck` is not installed in the local environment. Other hooks passed, including ruff, codespell, typos, clang-format, markdownlint, GitHub Actions workflow lint, PNG export lint, filename-space check, Python package `__init__.py` check, forbidden logger/import checks, boolean-op check, and suggestion check. --------- Signed-off-by: weijinqian_v1 <weijinqian@huawei.com> Co-authored-by: weijinqian_v1 <weijinqian@huawei.com>
6363175512 [Doc][Misc] Update release notes known issues (#10039) ### What this PR does / why we need it? Update v0.20.2rc1 release notes known issues and add the DeepSeek V4 KV Pool known issue reference. - git diff --check - vLLM version: v0.20.2 - vLLM main: https://github.com/vllm-project/vllm/commit/9090368b650896bf5fc990c921df7eb4c20355a5 Signed-off-by: weijinqian_v1 <weijinqian@huawei.com> Co-authored-by: weijinqian_v1 <weijinqian@huawei.com> (cherry picked from commit 6230b36103676268fc8b28ec397eb0af41fe3c7c)
3790531219 [Doc][Misc] Prepare v0.20.2rc1 release notes (#9602) ### What this PR does / why we need it? This PR prepares the v0.20.2rc1 release documentation set. It adds the new release notes entry for `v0.20.2rc1` and updates the main branch documentation references so the latest RC entry, FAQ link, and version matrix all point to the current release candidate. Related release tracking: - Release checklist: #9591 - Feedback issue: #9586 ### Does this PR introduce _any_ user-facing change? No. This is a documentation-only update for the v0.20.2rc1 release process. ### How was this patch tested? - Reviewed the staged markdown and version substitutions with `git diff` and `rg` - Ran `python -m py_compile docs/source/conf.py` - Full Sphinx build was not run locally because the current environment does not have `docutils` installed - vLLM version: v0.20.2 - vLLM main: https://github.com/vllm-project/vllm/commit/9090368b650896bf5fc990c921df7eb4c20355a5 --------- Signed-off-by: weijinqian_v1 <weijinqian@huawei.com> Co-authored-by: weijinqian_v1 <weijinqian@huawei.com> (cherry picked from commit 078fae3f10bceb8fe75060167f334110c20bdaf1)
- Compare 4 commits »