Travis King's picture

In a Training Loop 🔄

Travis King

travisking

·

AI & ML interests

have you heard of generative AI?

Recent Activity

upvoted a collection about 5 hours ago

SWE-FastContext

upvoted a collection 4 days ago

Gemma 4 QAT Q4_0

liked a dataset 6 days ago

agents-last-exam/agents-last-exam

View all activity

Organizations

None yet

upvoted a collection about 5 hours ago

SWE-FastContext

A family of code-search models powering the Explore subagent for coding agents. • 2 items • Updated 2 days ago • 9

upvoted a collection 4 days ago

Gemma 4 QAT Q4_0

19 items • Updated 11 days ago • 127

upvoted a paper 6 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 13 days ago • 344

upvoted a collection 13 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 4 days ago • 157

upvoted a paper 14 days ago

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published 23 days ago • 35

upvoted 3 papers 15 days ago

LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis

Paper • 2605.30434 • Published 19 days ago • 23

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 18 days ago • 112

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Paper • 2605.28424 • Published 20 days ago • 32

upvoted an article 28 days ago

Article

Introducing the Ettin Reranker Family

tomaarsen

•

28 days ago

• 51

upvoted a collection 28 days ago

Toto-2.0

5 items • Updated May 11 • 35

upvoted 2 collections about 1 month ago

Granite Speech

Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 7 items • Updated Apr 29 • 34

Granite Embedding

Embedding models (bi‑encoders and rerankers) for RAG, semantic search, and retrieval tasks. • 9 items • Updated Apr 30 • 45

upvoted 3 papers about 1 month ago

Retrieval from Within: An Intrinsic Capability of Attention-Based Models

Paper • 2605.05806 • Published May 8 • 7

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Paper • 2605.09608 • Published May 10 • 53

Efficient Pre-Training with Token Superposition

Paper • 2605.06546 • Published May 7 • 46

upvoted a collection about 2 months ago

Granite 4.1 Language Models

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 59

upvoted an article about 2 months ago

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

Apr 29

• 81

upvoted 2 collections 2 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 171

Gemma 4

15 items • Updated 5 days ago • 969

upvoted a paper 4 months ago

EuroLLM-22B: Technical Report

Paper • 2602.05879 • Published Feb 5 • 3