Skip to content

Pull requests: quic/efficient-transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Porting fp16/bf16 support to release/v1.21.6
#961 opened May 5, 2026 by asmigosw Contributor Draft
[QEff. Finetuning] TP+DDP for transformers upgrade to v5.5.4
#960 opened May 4, 2026 by smedhe Contributor Loading…
kimi-k2-minor-fix
#959 opened May 4, 2026 by quic-mamta Contributor Loading…
Enable ffn blocking for dense models with automatic blocking configurator enhancement New feature or request qeff.blocking
#958 opened May 4, 2026 by kdulla Contributor Loading…
Optimize attention blocking nested loops
#957 opened Apr 30, 2026 by anujgupt-github Contributor Loading…
Layer wise changes for kimi model
#954 opened Apr 29, 2026 by abhishek-singh591 Contributor Loading…
[Nightly CI]: Creating separate Pipeline for Nightly Jobs
#953 opened Apr 29, 2026 by abukhoy Contributor Loading…
MLA Int4 changes
#949 opened Apr 28, 2026 by quic-mamta Contributor Loading…
First Block Caching Infra for diffusers Diffusers Use for PR related to diffusers in efficient-transformers.
#941 opened Apr 24, 2026 by quic-amitraj Contributor Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill enhancement New feature or request
#935 opened Apr 21, 2026 by vbaddi Contributor Loading…
Added MDP generation to QEff Compile
#930 opened Apr 21, 2026 by quic-mohmeh Loading…
Enabled Qwen3-VL embedding model
#923 opened Apr 20, 2026 by quic-amitraj Contributor Loading…
[Qwen3_Omni]_Onboarding
#922 opened Apr 20, 2026 by mohiso22 Contributor Draft
Enabling support of rerankers models 2B and 8B of qwen3vl
#921 opened Apr 18, 2026 by quic-amitraj Contributor Loading…
feat: Enable benchmark-mode module inventory/export across all CausalLM architectures enhancement New feature or request
#906 opened Apr 3, 2026 by vbaddi Contributor Loading…
qwen3_5_linear_attn
#901 opened Apr 1, 2026 by mohiso22 Contributor Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.