qiubo
thenext
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps upvoted a paper almost 2 years ago
Inference Performance Optimization for Large Language Models on CPUs