Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.84k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
main
quantization
/
cutlass_w8a8
/
c3x
56.9 kB
2 contributors
History:
2 commits
danieldk
HF Staff
Sync to vLLM 20250627
8aa00a3
7 months ago
cutlass_gemm_caller.cuh
3.98 kB
Sync on vLLM 20240402
9 months ago
scaled_mm.cuh
5.39 kB
Sync on vLLM 20240402
9 months ago
scaled_mm_azp_sm90_int8.cu
1.04 kB
Sync on vLLM 20240402
9 months ago
scaled_mm_blockwise_sm100_fp8.cu
862 Bytes
Sync to vLLM 20250627
7 months ago
scaled_mm_blockwise_sm100_fp8_dispatch.cuh
11.5 kB
Sync to vLLM 20250627
7 months ago
scaled_mm_blockwise_sm90_fp8.cu
854 Bytes
Sync to vLLM 20250627
7 months ago
scaled_mm_blockwise_sm90_fp8_dispatch.cuh
7.71 kB
Sync on vLLM 20240402
9 months ago
scaled_mm_helper.hpp
3.54 kB
Sync to vLLM 20250627
7 months ago
scaled_mm_kernels.hpp
2.28 kB
Sync to vLLM 20250627
7 months ago
scaled_mm_sm100_fp8.cu
980 Bytes
Sync on vLLM 20240402
9 months ago
scaled_mm_sm100_fp8_dispatch.cuh
5.6 kB
Sync to vLLM 20250627
7 months ago
scaled_mm_sm90_fp8.cu
972 Bytes
Sync on vLLM 20240402
9 months ago
scaled_mm_sm90_fp8_dispatch.cuh
4.69 kB
Sync on vLLM 20240402
9 months ago
scaled_mm_sm90_int8.cu
980 Bytes
Sync on vLLM 20240402
9 months ago
scaled_mm_sm90_int8_dispatch.cuh
6.52 kB
Sync on vLLM 20240402
9 months ago