ML Wiki

Tag: applications-systems

1 item with this tag.

  • Apr 12, 2026

    Splitwise: LLM Inference at Half the Cost by Splitting Prompt and Decode

    • source
    • inference-systems
    • llm-serving
    • applications-systems