ML Wiki
Search
Search
Explorer
Tag: applications-systems
1 item with this tag.
Apr 12, 2026
Splitwise: LLM Inference at Half the Cost by Splitting Prompt and Decode
source
inference-systems
llm-serving
applications-systems