Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support loading blockwise quantized fp8 #1080

Merged
merged 1 commit into from
Jan 22, 2025
Merged

Conversation

EricLBuehler
Copy link
Owner

For DeepSeek V3!

Copy link

Code Metrics Report
  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 C Header                2           35           28            0            7
 Dockerfile              1           41           22           10            9
 JSON                   12          105          104            0            1
 Python                 67         2824         2446           72          306
 Shell                   1           57           22           18           17
 Plain Text              3         3723            0         2413         1310
 TOML                   18          623          553            2           68
 YAML                    2           21           19            2            0
-------------------------------------------------------------------------------
 Jupyter Notebooks       4            0            0            0            0
 |- Markdown             2           77           32           31           14
 |- Python               2          205          178            1           26
 (Total)                            282          210           32           40
-------------------------------------------------------------------------------
 Markdown               45         3593            0         2732          861
 |- BASH                 6          103          100            0            3
 |- JSON                 1           12           12            0            0
 |- Python               7          121          109            0           12
 |- Rust                14          474          402            0           72
 |- TOML                 2           75           63            0           12
 (Total)                           4378          686         2732          960
-------------------------------------------------------------------------------
 Rust                  303        97808        87705         1976         8127
 |- Markdown           146         1676           25         1527          124
 (Total)                          99484        87730         3503         8251
===============================================================================
 Total                 458       108830        90899         7225        10706
===============================================================================
  

@EricLBuehler EricLBuehler merged commit df318df into master Jan 22, 2025
12 checks passed
@EricLBuehler EricLBuehler deleted the support_loading_fp8 branch January 22, 2025 12:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant