Overview of vLLM Optimization Techniques 【2024-10-26】An overview of vLLM optimization techniques, introducing PagedAttention and continuous batching solutions.
Improvements to the Legendary Fastest Terminal 【2023-02-06】Recently, I discovered Alacritty, a cross-platform terminal emulator powered by Rust and accelerated with OpenGL, merely around 5MB in size, touted as the fastest terminal. However, it's truly unattractive. I contemplated whether to revamp it to possibly make it my default terminal. The final product turned out quite well.