intel / auto-round Public

Notifications You must be signed in to change notification settings
Fork 72
Star 825

Code
Issues 88
Pull requests 19
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: intel/auto-round

Labels 26 Milestones 3

New pull request New

19 Open 934 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

add blockwise way in pure rtn to ease module replacements

#1320 opened Jan 22, 2026 by wenhuach21

Loading…

18 tasks

set predefined ignore layers

#1318 opened Jan 22, 2026 by wenhuach21

Loading…

2 of 18 tasks

[GGUF] using quant_nontext_module to control whether quant vision model

#1317 opened Jan 22, 2026 by n1ck-guo

Loading…

8 of 18 tasks

Add asym for XPU backend.

#1316 opened Jan 22, 2026 by luoyu-intel • Draft

Update torch to 2.9.1 in CI

#1313 opened Jan 22, 2026 by XuehaoSun

Loading…

align act_max of experts for qwen3-vl and qwen3-next

#1311 opened Jan 21, 2026 by xin3he

Loading…

Delay materializing the replaced model weights until quantization

#1307 opened Jan 21, 2026 by yiliu30

Loading…

5 of 6 tasks

open source lib code

#1304 opened Jan 20, 2026 by chensuyue

Loading…

Optimize FP8 layer conversion by skipping weight initialization

#1295 opened Jan 16, 2026 by Copilot AI

Loading…

[WIP]Ds v32

#1291 opened Jan 16, 2026 by yiliu30 • Draft

Robust FP8 layer detection for ignore_layers (#1283)

#1289 opened Jan 15, 2026 by scopophobic

Loading…

Fix ignore_layers not working for FP8 models

#1286 opened Jan 15, 2026 by Copilot AI

Loading…

11 tasks done

Preserve FP8 format for ignored layers in FP8 models

#1285 opened Jan 15, 2026 by Copilot AI • Draft

[WIP][refactor quanizers][step 1] refactor rtn and tuning

#1278 opened Jan 14, 2026 by n1ck-guo

Loading…

(feat): add support for g2 fp8 on cpu with LUT

#1254 opened Jan 10, 2026 by SwekeR-463

Loading…

fix disable_opt_rtn spelling error

#1250 opened Jan 9, 2026 by WeiweiZhang1

Loading…

extend compatible test

#1131 opened Dec 12, 2025 by chensuyue

Loading…

add per-task lm_eval args for exprimental usage

#1017 opened Nov 11, 2025 by WeiweiZhang1

Loading…

[WIP] [STEP 2] split compressor into few quantizers

#841 opened Sep 23, 2025 by n1ck-guo

Loading…

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!