dpo/sft tuned language models on politune
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a collection about 1 month ago
MastermindEval updated a collection about 1 month ago
MastermindEval updated a collection about 1 month ago
PolituneOrganizations
models 24
whoisjones/politune-qwen3-8b-right-dpo
Text Generation • Updated • 2
whoisjones/politune-qwen3-8b-right-sft
Text Generation • Updated • 3
whoisjones/politune-qwen3-8b-left-dpo
Text Generation • Updated • 2
whoisjones/politune-qwen3-8b-left-sft
Text Generation • Updated • 2
whoisjones/politune-mistral-7b-right-dpo
Text Generation • Updated • 2
whoisjones/politune-mistral-7b-right-sft
Text Generation • Updated • 4
whoisjones/politune-mistral-7b-left-dpo
Text Generation • Updated • 1
whoisjones/politune-mistral-7b-left-sft
Text Generation • Updated • 3
whoisjones/politune-llama3-8b-right-dpo
Text Generation • Updated • 4
whoisjones/politune-llama3-8b-right-sft
Text Generation • Updated • 4
datasets 29
whoisjones/finerweb_document_context
Updated • 37
whoisjones/sudoku
Viewer • Updated • 1.42M • 27
whoisjones/maze
Viewer • Updated • 9k • 8
whoisjones/multinerd
Viewer • Updated • 1.67M • 270
whoisjones/masakhaner
Viewer • Updated • 153k • 606 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 75
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 149 • 9
whoisjones/fiNERweb-x
Updated • 1.71k
whoisjones/fiNERweb-x-multi
Updated • 8
whoisjones/fiNERweb-gemma-x-multi
Updated • 135