S1.1
updated
Preview
• Updated • 680
• 92
argilla/intel-orca-dpo-pairs-helm-instruct
Viewer
• Updated • 5 • 12
• 1
argilla/OpenHermes2.5-dpo-binarized-alpha
Viewer
• Updated • 9.79k • 55
• 63
argilla/ultrafeedback-critique
Viewer
• Updated • 253k • 38
• 4
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated • 60.9k • 14.9k
• 162
CohereLabs/BinaryVectorDB
Updated • 220
• 8
ai2lumos/lumos_maths_plan_onetime
Viewer
• Updated • 19.8k • 53
• 2
ai2lumos/lumos_unified_plan_iterative
Viewer
• Updated • 55.4k • 66
• 2
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
• Updated • 19.4k • 78
• 3
Viewer
• Updated • 10k • 95
• 30
lmsys/mt_bench_human_judgments
Viewer
• Updated • 5.76k • 2.46k
• 144
lmsys/chatbot_arena_conversations
Viewer
• Updated • 33k • 61.1k
• 463
vicgalle/configurable-system-prompt-multitask
Viewer
• Updated • 1.95k • 48
• 29
paraloq/json_data_extraction
Viewer
• Updated • 484 • 209
• 33
Viewer
• Updated • 479 • 138
• 6
iamtarun/python_code_instructions_18k_alpaca
Viewer
• Updated • 18.6k • 26.2k
• 345
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Paper
• 2403.15042
• Published • 27
Viewer
• Updated • 2.35k • 14
• 1
Paper
• 2402.12219
• Published • 17
Viewer
• Updated • 20.2k • 164
• 38
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated • 7.99k • 69
• 11
SanjiWatsuki/Kunoichi-DPO-v2-7B
Text Generation
• 7B • Updated • 231
• • 89
Viewer
• Updated • 17.3k • 979
• 36
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated • 44.2k • 1.19k
• 302
Viewer
• Updated • 529k • 5.87k
• 190
Viewer
• Updated • 149k • 47
• 7
FreedomIntelligence/evol-instruct-hindi
Viewer
• Updated • 59k • 63
• 2
totally-not-an-llm/EverythingLM-data-V3
Viewer
• Updated • 1.07k • 54
• 32
RUCAIBox/Story-Generation
Updated • 55
• 13
Viewer
• Updated • 49.6k • 3.99k
• 179
Norquinal/claude_multiround_chat_30k
Viewer
• Updated • 32.2k • 144
• 71
Norquinal/claude_multi_instruct_30k
Viewer
• Updated • 32.2k • 23
• 10
Viewer
• Updated • 1.72M • 30
• 9
Locutusque/OpenCerebrum-2.0-SFT
Viewer
• Updated • 6.4k • 52
• 6
Locutusque/OpenCerebrum-2.0-DPO
Viewer
• Updated • 720 • 48
• 6
Preview
• Updated • 584
• 11
Preview
• Updated • 61
• 28
Viewer
• Updated • 1.46M • 43
• 15
Viewer
• Updated • 21.4k • 4.53k
• 452
nvidia/Nemotron-4-340B-Reward
Updated • 42
• 127
Magpie-Align/Magpie-Pro-MT-300K-v0.1
Viewer
• Updated • 300k • 1.27k
• 32
nvidia/Aegis-AI-Content-Safety-Dataset-1.0
Viewer
• Updated • 12k • 2.76k
• 60
Salesforce/xlam-function-calling-60k
Viewer
• Updated • 60k • 35.6k
• 633
Viewer
• Updated • 21.9M • 3.18k
• 727
diwank/llmlingua-compressed-text
Viewer
• Updated • 222k • 51
• 2
diwank/python-code-execution-output
Viewer
• Updated • 3.61k • 39
• 1
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on
Mobile Devices
Paper
• 2406.08451
• Published • 26
Viewer
• Updated • 99.5k • 3.23k
• 29
Viewer
• Updated • 327 • 36
• 13
Viewer
• Updated • 728 • 22
• 9
HannahRoseKirk/prism-alignment
Viewer
• Updated • 77.9k • 1.16k
• 102
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
• 8B • Updated • 23.9k
• 184
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
• 8B • Updated • 991
• 60
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
• Updated • 29.9k • 1.91k
• 13
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer
• Updated • 249k • 327
• 63
Viewer
• Updated • 68.1k • 42.7k
• 35
Viewer
• Updated • 12.7k • 27
• 5
imbue/human_question_quality_judgments
Viewer
• Updated • 167k • 9
• 9
Viewer
• Updated • 54k • 69
• 21
imbue/high_quality_public_evaluations
Viewer
• Updated • 12.8k • 18
• 6
imbue/high_quality_private_evaluations
Viewer
• Updated • 10.6k • 35
• 8
Text Generation
• 27B • Updated • 4.04k
• • 209
Viewer
• Updated • 1.46M • 819
• 4
Viewer
• Updated • 375k • 6.19k
• 772
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
• 2406.20094
• Published • 107
Viewer
• Updated • 1.24M • 373
• 8
Viewer
• Updated • 1.25M • 746
• 5
Viewer
• Updated • 2.05M • 416
• 3
Viewer
• Updated • 326k • 56
• 8
Updated • 766k
• 62
Updated • 1.11k
• 12
Updated • 642
• 11
Image-Text-to-Text
• 34B • Updated • 15
• 89
Image-Text-to-Text
• 7B • Updated • 216k
• 202
gokaygokay/random_instruct_docci
Viewer
• Updated • 14.6k • 328
• 6
Text Generation
• 8B • Updated • 3.15k
• 18
Gryphe/Opus-WritingPrompts
Viewer
• Updated • 6.02k • 383
• 80
Viewer
• Updated • 14.9k • 856
• 43
Viewer
• Updated • 3k • 45
• 13
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference
Datasets
Paper
• 2405.18952
• Published • 10
Image-Text-to-Text
• 4B • Updated • 60.3k
• 56
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
• 76B • Updated • 249
• 213
QuasarResearch/apollo-preview-v0.2
Viewer
• Updated • 15.4k • 39
• 9
Viewer
• Updated • 51.4k • 84
• 80
fireworks-ai/nexus_parallel_messages
Viewer
• Updated • 70 • 34
• 6
fireworks-ai/nexus_parallel_functions
Viewer
• Updated • 29 • 23
• 4
Viewer
• Updated • 539 • 42
• 27
Viewer
• Updated • 18.6k • 837
• 7
Viewer
• Updated • 259 • 303
• 2
Viewer
• Updated • 486k • 123
• 64
Viewer
• Updated • 1.75M • 126
• 105
Viewer
• Updated • 860k • 58.1k
• 589
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
• Updated • 181k • 83
• 92
chargoddard/WebInstructSub-prometheus
Viewer
• Updated • 2.39M • 200
• 25
Viewer
• Updated • 1.96k • 40
• 30
Viewer
• Updated • 294k • 27
• 33
chargoddard/chai-feedback-pairs
Viewer
• Updated • 30.1k • 18
• 5
nayohan/multi_session_chat
Viewer
• Updated • 23.4k • 542
• 7
nvidia/Mistral-NeMo-12B-Instruct
Updated • 232
• 177
nvidia/Mistral-NeMo-12B-Base
Updated • 63
• 42
Text Generation
• 8B • Updated • 1.3M
• • 2.25k
meta-llama/Prompt-Guard-86M
Text Classification
• 0.3B • Updated • 376k
• • 339
Viewer
• Updated • 6.41k • 98
• 38
mistralai/Mistral-Large-Instruct-2407
123B • Updated • 5.88k
• 861
Symbol-LLM/Symbolic_Collection
Viewer
• Updated • 975k • 70
• 13
Viewer
• Updated • 100k • 17.6k
• 272
roborovski/dolly-entity-extraction
Viewer
• Updated • 5.95k • 22
• 2
kalomaze/Opus_Instruct_25k
Viewer
• Updated • 25.1k • 79
• 37
Vezora/Code-Preference-Pairs
Viewer
• Updated • 54k • 1.3k
• 32
Text Generation
• 71B • Updated • 2.4k
• • 199
Text Generation
• 8B • Updated • 446
• • 87
Viewer
• Updated • 270k • 44
• 7
OpenBuddy/openbuddy-llama3.1-8b-v22.2-131k
Text Generation
• 8B • Updated • 8
• • 2
Text Generation
• 3B • Updated • 207k
• • 655
Updated • 202
Text Generation
• 3B • Updated • 7.35k
• • 122
Viewer
• Updated • 11.2k • 31
• 7
argilla/magpie-ultra-v0.1
Viewer
• Updated • 50k • 718
• 222
mlabonne/Llama-3.1-70B-Instruct-lorablated-GGUF
71B • Updated • 436
• 47
Viewer
• Updated • 55.1k • 60
• 96
Text Generation
• 20B • Updated • 692
• 17
Viewer
• Updated • 1.02k • 90
• 13
Viewer
• Updated • 2.39M • 178
• 8
Viewer
• Updated • 6k • 554
• 201
Viewer
• Updated • 282 • 17
• 1
Gryphe/Sonnet3.5-Charcard-Roleplay
Viewer
• Updated • 9.74k • 171
• 88
NousResearch/hermes-function-calling-v1
Viewer
• Updated • 11.6k • 30.5k
• 417
AlgorithmicResearchGroup/ArXivDLInstruct
Viewer
• Updated • 778k • 159
• 15
upstage/solar-pro-preview-instruct
Text Generation
• 22B • Updated • 39.9k
• 457
mistral-community/pixtral-12b-240910
Image-Text-to-Text
• Updated • 4.5k
• 381
arcee-ai/Llama-3.1-SuperNova-Lite
Text Generation
• 8B • Updated • 1.39k
• • 198
Skywork/Skywork-Reward-Gemma-2-27B
Text Classification
• 27B • Updated • 37
• 49
Viewer
• Updated • 59.4k • 256
• 81
Viewer
• Updated • 29.9k • 680
• 77
argilla/FinePersonas-v0.1
Viewer
• Updated • 42.1M • 9.63k
• 409
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published • 140
bespokelabs/Bespoke-MiniCheck-7B
Text Classification
• 8B • Updated • 8.62k
• 83
Viewer
• Updated • 13.6k • 149
• 20
mlabonne/open-perfectblend
Viewer
• Updated • 1.42M • 1.1k
• 73
rombodawg/Everything_Instruct
Viewer
• Updated • 4.05M • 53
• 54
Viewer
• Updated • 290k • 357
• 43
Viewer
• Updated • 2.2M • 18.6k
• 415
argilla/magpie-ultra-v1.0
Viewer
• Updated • 3.22M • 2.58k
• 51
Viewer
• Updated • 80k • 492
• 15
Viewer
• Updated • 31.4k • 342
• 23
Viewer
• Updated • 4.5k • 2.4k
• 38
CohereLabs/include-lite-44
Viewer
• Updated • 10.8k • 1.74k
• 16
Viewer
• Updated • 20.1k • 41
• 21
Gryphe/ChatGPT-4o-Writing-Prompts
Viewer
• Updated • 3.74k • 505
• 30
Updated • 33
• 9
Viewer
• Updated • 5.12k • 35
• 8
openerotica/pippa_scored2sharegpt
Viewer
• Updated • 1.96k • 25
• 2
openerotica/erotica-analysis
Viewer
• Updated • 15k • 115
• 33
iamketan25/roleplay-instructions-dataset
Viewer
• Updated • 3.15k • 105
• 31
AlekseyKorshuk/roleplay-characters
Viewer
• Updated • 784 • 80
• 25
Viewer
• Updated • 1.92k • 17
• 4
AlekseyKorshuk/erotic-books
Viewer
• Updated • 646 • 110
• 28
huihui-ai/Llama-3.3-70B-Instruct-abliterated
Text Generation
• 71B • Updated • 4.01k
• • 75
practical-dreamer/RPGPT_PublicDomain-alpaca
Viewer
• Updated • 4.26k • 92
• 33
lemonilia/Roleplay-Forums_2023-04
Updated • 676
• 15
Updated • 8
• 10
Updated • 87
• 112
QuasarResearch/apollo-preview-v0.4
Viewer
• Updated • 27.1k • 79
• 7
QuasarResearch/Quasar-CW-1k-v0.1
Viewer
• Updated • 1.05k • 9
• 3
Viewer
• Updated • 13.3k • 2.07k
• 47
microsoft/orca-agentinstruct-1M-v1
Viewer
• Updated • 1.05M • 3.33k
• 465
Sao10K/Llama-3.1-8B-Stheno-v3.4
8B • Updated • 628
• 90
Heralax/RPToolkit-demo-dataset
Viewer
• Updated • 2.7k • 27
• 15
Heralax/Mannerstral-dataset
Viewer
• Updated • 5.92k • 25
• 2
EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
Text Generation
• 71B • Updated • 19
• • 20
Sao10K/14B-Qwen2.5-Kunou-v1
Text Generation
• 15B • Updated • 224
• • 35
EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2
Text Generation
• 33B • Updated • 225
• • 59
Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned
Viewer
• Updated • 2.74k • 92
• 15
allura-org/fujin-cleaned-stage-2
Viewer
• Updated • 11.8k • 8
• 2
allura-org/r_shortstories_24k
Viewer
• Updated • 23.7k • 21
• 6
allura-org/sugarquill-10k
Viewer
• Updated • 9.88k • 14
• 3
nothingiisreal/Reddit-Dirty-And-WritingPrompts
Viewer
• Updated • 393k • 142
• 65
nothingiisreal/Kalomaze-Opus-Instruct-25k-filtered
Viewer
• Updated • 48.7k • 62
• 2
nothingiisreal/DirtyWritingPrompts
Viewer
• Updated • 11.3k • 10
• 9
nothingiisreal/Human_Stories
Viewer
• Updated • 3.02k • 12
• 7
nothingiisreal/open-gpt-3.5-detector
Text Classification
• 67M • Updated • 9
• 4
lemonilia/Elliquiy-Role-Playing-Forums_2023-04
Viewer
• Updated • 112k • 83
• 10
Viewer
• Updated • 3.4k • 6.11k
• 59
amphora/QwQ-LongCoT-130K-2
Viewer
• Updated • 138k • 44
• 28
nebius/SWE-agent-trajectories
Viewer
• Updated • 80k • 3.05k
• 84
Sao10K/32B-Qwen2.5-Kunou-v1
Text Generation
• 33B • Updated • 16
• • 40
Viewer
• Updated • 6.87k • 45
• 39
jihyoung/ConversationChronicles
Viewer
• Updated • 200k • 513
• 11
Viewer
• Updated • 6.47M • 689
• 3
OpenLeecher/lmsys_chat_1m_clean
Viewer
• Updated • 273k • 477
• 85
HumanLLMs/Human-Like-DPO-Dataset
Viewer
• Updated • 10.9k • 719
• 257
Enhancing Human-Like Responses in Large Language Models
Paper
• 2501.05032
• Published • 62
nvidia/CantTalkAboutThis-Topic-Control-Dataset-NC
Viewer
• Updated • 1.19k • 92
• 7
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated • 16.4k • 409
• 186
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B
Viewer
• Updated • 250k • 388
• 106
bespokelabs/open-thoughts-code-annotations
Viewer
• Updated • 4 • 15
• 2
open-thoughts/OpenThoughts-114k
Viewer
• Updated • 228k • 99.9k
• 860
Viewer
• Updated • 1k • 3.13k
• 240
ByteDance-Seed/mga-fineweb-edu
Viewer
• Updated • 846M • 655
• 36
Viewer
• Updated • 817 • 6.64k
• 177
Viewer
• Updated • 4.59k • 1.2k
• 11
harpreetsahota/llama3_1-405B-on-IFEval
Viewer
• Updated • 541 • 86
• 4
HuggingFaceTB/everyday-conversations-llama3.1-2k
Viewer
• Updated • 2.38k • 1.56k
• 131
declare-lab/AlgoPuzzleVQA
Viewer
• Updated • 1.8k • 172
• 9
allenai/OLMo-2-0325-32B-Instruct
Text Generation
• 32B • Updated • 6.07k
• 148
Viewer
• Updated • 3.2k • 142
• 3
Viewer
• Updated • 200 • 382
• 3
Viewer
• Updated • 1.84M • 520
• 53
Text-to-Speech
• Updated • 3.2k
• 655
Viewer
• Updated • 23.3k • 17k
• 51