Ingest Log

[2026-05-09] ingest | LLaVA-1.5: Improved Baselines with Visual Instruction Tuning

[2026-05-09] ingest | CodeAct: Executable Code Actions Elicit Better LLM Agents

[2026-05-09] ingest | SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery

[2026-05-09] ingest | t2vec: Deep Representation Learning for Trajectory Similarity Computation

[2026-05-09] ingest | Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

[2026-05-09] ingest | ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction

[2026-05-09] ingest | GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

[2026-05-09] ingest | MTEB: Massive Text Embedding Benchmark

[2026-05-09] ingest | Word2Vec: Efficient Estimation of Word Representations in Vector Space

[2026-05-09] ingest | C-Pack / BGE: Packed Resources for General Chinese Embeddings

[2026-05-09] ingest | Phi-3 Technical Report

[2026-05-09] ingest | PyTorch FSDP: Fully Sharded Data Parallel

[2026-05-09] ingest | Megatron-LM: Training Multi-Billion Parameter Language Models

[2026-05-09] ingest | Orca: A Distributed Serving System for Transformer-Based Generative Models

[2026-05-09] ingest | FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

[2026-05-09] ingest | ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

[2026-05-09] ingest | On Calibration of Modern Neural Networks

[2026-05-09] ingest | Hidden Markov Map Matching Through Noise and Sparseness

  • Source: hidden-markov-map-matching-noise-sparseness
  • Reference: ACM SIGSPATIAL 2009 (Microsoft Research)
  • Key concepts: hidden-markov-models, map-matching, viterbi
  • Entities: microsoft-research
  • One-line takeaway: Modeling a GPS trace plus road network as an HMM (Gaussian emission in perpendicular distance, exponential transition in |d_GPS - d_road|) and decoding with Viterbi is the canonical map-matching algorithm — robust to sparse sampling and 50m+ noise — and underlies every modern snap-to-road system.

[2026-05-09] ingest | SAM 2: Segment Anything in Images and Videos

[2026-05-09] ingest | Qwen2.5-VL Technical Report

[2026-05-08] ingest | Flamingo: A Visual Language Model for Few-Shot Learning

[2026-05-07] ingest | Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

[2026-05-06] ingest | RAG: Retrieval-Augmented Generation for Knowledge-Intensive NLP

[2026-05-05] ingest | Self-Rewarding Language Models

[2026-05-04] ingest | Training Compute-Optimal Large Language Models (Chinchilla)

[2026-05-04] ingest | AWQ: Activation-Aware Weight Quantization for LLM Compression and Acceleration

[2026-05-03] ingest | Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task (Othello-GPT)

[2026-05-02] ingest | Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

[2026-04-30] ingest | BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

[2026-04-29] ingest | ORPO: Monolithic Preference Optimization without Reference Model

Entries are appended chronologically as sources are ingested.

[2026-04-28] ingest | Highly Accurate Protein Structure Prediction with AlphaFold

[2026-04-27] ingest | Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context

[2026-04-26] ingest | Neural Machine Translation of Rare Words with Subword Units (BPE)

[2026-04-25] ingest | Learning to Summarize from Human Feedback

  • Source: learning-to-summarize-human-feedback
  • Key concepts: rlhf | reward-model | ppo | alignment | sft
  • One-line takeaway: Train a reward model on human pairwise comparisons, then use PPO to optimize the language model against it — this three-stage pipeline produces summaries humans prefer over SFT baselines and over models 30× larger.

[2026-04-24] ingest | Mixture of Depths: Dynamically Allocating Compute in Transformer-Based Language Models

[2026-04-24] ingest | Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR

[2026-04-23] ingest | DINOv2: Learning Robust Visual Features without Supervision

[2026-04-22] ingest | Distilling the Knowledge in a Neural Network

[2026-04-22] ingest | KTO: Model Alignment as Prospect Theoretic Optimization

[2026-04-21] ingest | GPT-4 Technical Report

[2026-04-21] ingest | Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

[2026-04-20] ingest | BART: Denoising Sequence-to-Sequence Pre-training

[2026-04-20] ingest | GQA: Grouped-Query Attention

[2026-04-19] ingest | Deep Residual Learning for Image Recognition

[2026-04-18] ingest | Evaluating Large Language Models Trained on Code (Codex)

[2026-04-18] ingest | Scalable Diffusion Models with Transformers (DiT)

[2026-04-18] ingest | Mistral 7B

[2026-04-17] ingest | LIMA: Less Is More for Alignment

[2026-04-17] ingest | Tree of Thoughts: Deliberate Problem Solving with Large Language Models

[2026-04-17] ingest | Self-Consistency Improves Chain of Thought Reasoning in Language Models

[2026-04-17] ingest | QLoRA: Efficient Finetuning of Quantized LLMs

[2026-04-17] ingest | Emerging Properties in Self-Supervised Vision Transformers (DINO)

[2026-04-17] ingest | Masked Autoencoders Are Scalable Vision Learners

[2026-04-17] ingest | A Simple Framework for Contrastive Learning of Visual Representations

[2026-04-17] ingest | Proximal Policy Optimization Algorithms

  • Source: proximal-policy-optimization
  • Key concepts: ppo | policy-gradient | reinforcement-learning | rlhf
  • One-line takeaway: Clipping the probability ratio in the surrogate objective to [1-ε, 1+ε] prevents destructively large policy updates — giving PPO TRPO-level sample efficiency at a fraction of the implementation complexity. The workhorse behind RLHF.

[2026-04-17] ingest | High-Resolution Image Synthesis with Latent Diffusion Models

[2026-04-17] ingest | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5)

[2026-04-17] ingest | Language Models are Unsupervised Multitask Learners

[2026-04-17] ingest | Constitutional AI: Harmlessness from AI Feedback

[2026-04-16] ingest | Whisper: Robust Speech Recognition via Large-Scale Weak Supervision

[2026-04-16] ingest | Toolformer: Language Models Can Teach Themselves to Use Tools

[2026-04-16] ingest | Switch Transformers: Scaling to Trillion Parameter Models with Sparse MoE

[2026-04-16] ingest | ReAct: Synergizing Reasoning and Acting in Language Models

[2026-04-16] ingest | RAG: Retrieval-Augmented Generation for Knowledge-Intensive NLP

[2026-04-16] ingest | LLaVA: Visual Instruction Tuning

[2026-04-16] ingest | GRPO: Group Relative Policy Optimization (DeepSeekMath)

[2026-04-16] ingest | DeepSeek-R1: Incentivizing Reasoning via Reinforcement Learning

[2026-04-16] ingest | Denoising Diffusion Probabilistic Models

[2026-04-16] ingest | Constitutional AI: Harmlessness from AI Feedback

  • Source: constitutional-ai-harmlessness-from-ai-feedback
  • Key concepts: alignment | rlhf | sft
  • One-line takeaway: 16 written principles plus an AI feedback loop (RLAIF) replace human red-teamers — the model self-critiques and revises its own harmful outputs, then trains a preference model from AI-generated comparisons.

[2026-04-16] ingest | Segment Anything

[2026-04-15] ingest | Scaling Laws for Neural Language Models

[2026-04-14] ingest | Adam: A Method for Stochastic Optimization

[2026-04-13] ingest | BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

[2026-04-11] ingest | An Image is Worth 16x16 Words (ViT)

[2026-04-10] ingest | Training language models to follow instructions with human feedback (InstructGPT)

[2026-04-09] ingest | CLIP: Learning Transferable Visual Models From Natural Language Supervision

[2026-04-08] ingest | Emergent Abilities of Large Language Models

[2026-04-06] ingest | RoPE: Enhanced Transformer with Rotary Position Embedding

[2026-04-04] ingest | Direct Preference Optimization: Your Language Model is Secretly a Reward Model

[2026-04-05] ingest | Attention Is All You Need

[2026-04-05] ingest | LoRA: Low-Rank Adaptation of Large Language Models

[2026-04-05] ingest | FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

[2026-04-05] ingest | Efficient Memory Management for Large Language Model Serving with PagedAttention

[2026-04-05] ingest | Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

[2026-04-05] ingest | Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

[2026-04-05] ingest | A Systematic Empirical Study of Grokking: Depth, Architecture, Activation, and Regularization

[2026-04-05] ingest | Mamba: Linear-Time Sequence Modeling with Selective State Spaces

[2026-04-05] ingest | Fast Inference from Transformers via Speculative Decoding

[2026-04-05] ingest | Falcon Perception: Early-Fusion Transformer for Open-Vocabulary Grounding and Segmentation

[2026-04-12] ingest | Splitwise: Efficient Generative LLM Inference Using Phase Splitting

[2026-04-17] stub | High-Resolution Image Synthesis with Latent Diffusion Models

[2026-04-17] stub | Scalable Diffusion Models with Transformers (DiT)

[2026-04-17] stub | Mistral 7B

  • Source: mistral-7b
  • ArXiv: https://arxiv.org/abs/2310.06825
  • One-line takeaway: Combining grouped-query attention and sliding window attention produces a 7B model that outperforms LLaMA 2 13B with lower inference cost.

[2026-04-17] stub | Distilling the Knowledge in a Neural Network

[2026-04-17] stub | QLoRA: Efficient Finetuning of Quantized LLMs

[2026-04-17] stub | Mixture of Depths: Dynamically Allocating Compute in Transformer LLMs

[2026-04-17] stub | LIMA: Less Is More for Alignment

[2026-04-17] stub | Learning to Summarize from Human Feedback

[2026-04-17] stub | Proximal Policy Optimization Algorithms

[2026-04-17] stub | KTO: Model Alignment as Prospect Theoretic Optimization

[2026-04-17] stub | Llama 2: Open Foundation and Fine-Tuned Chat Models

[2026-04-17] stub | GPT-4 Technical Report

[2026-04-17] stub | Self-Consistency Improves Chain of Thought Reasoning

[2026-04-17] stub | Tree of Thoughts: Deliberate Problem Solving with LLMs

[2026-04-17] stub | Deep Residual Learning for Image Recognition

[2026-04-17] stub | Masked Autoencoders Are Scalable Vision Learners

[2026-04-17] stub | A Simple Framework for Contrastive Learning of Visual Representations

[2026-04-17] stub | Emerging Properties in Self-Supervised Vision Transformers (DINO)

[2026-04-17] stub | DINOv2: Learning Robust Visual Features without Supervision