-
Notifications
You must be signed in to change notification settings - Fork 81
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[QEff. Finetuning] TP+DDP for transformers upgrade to v5.5.4
#960
opened May 4, 2026 by
smedhe
Contributor
Loading…
Enable ffn blocking for dense models with automatic blocking configurator
enhancement
New feature or request
qeff.blocking
#958
opened May 4, 2026 by
kdulla
Contributor
Loading…
Optimize attention blocking nested loops
#957
opened Apr 30, 2026 by
anujgupt-github
Contributor
Loading…
Layer wise changes for kimi model
#954
opened Apr 29, 2026 by
abhishek-singh591
Contributor
Loading…
[Nightly CI]: Creating separate Pipeline for Nightly Jobs
#953
opened Apr 29, 2026 by
abukhoy
Contributor
Loading…
fix: improve weight offloading to handle plain tensor attrs and use to_empty()
#952
opened Apr 28, 2026 by
quic-rishinr
Contributor
Loading…
Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf export issue
#950
opened Apr 28, 2026 by
quic-rishinr
Contributor
Loading…
First Block Caching Infra for diffusers
Diffusers
Use for PR related to diffusers in efficient-transformers.
#941
opened Apr 24, 2026 by
quic-amitraj
Contributor
Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill
enhancement
New feature or request
#935
opened Apr 21, 2026 by
vbaddi
Contributor
Loading…
updated blocking in diffusers with cross attention check instead of SL
#932
opened Apr 21, 2026 by
tv-karthikeya
Contributor
Loading…
CB Bug fix for Qwen3VL Dense and basic cleaning of example script and Model File
#926
opened Apr 20, 2026 by
qcdipankar
Contributor
Loading…
Enabling support of rerankers models 2B and 8B of qwen3vl
#921
opened Apr 18, 2026 by
quic-amitraj
Contributor
Loading…
Removed redundancies from QEFFHybridCache and QEFFHybridChunkedCache
#914
opened Apr 13, 2026 by
quic-mamta
Contributor
•
Draft
revert(export): Revert proxy-only ONNX transform gating and restore default export behavior
1.21.0
#912
opened Apr 10, 2026 by
vbaddi
Contributor
Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures
enhancement
New feature or request
#906
opened Apr 3, 2026 by
vbaddi
Contributor
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.