A Case Study of Web App Coding with OpenAI Reasoning Models Paper • 2409.13773 • Published Sep 19, 2024 • 7
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper • 2409.05177 • Published Sep 8, 2024 • 8
WebApp1K: A Practical Code-Generation Benchmark for Web App Development Paper • 2408.00019 • Published Jul 30, 2024 • 2
Riot Gremlins 👹🙀 Collection 7B Models For merging. for RP on my TINY rig at Q6. Without a bloody POD. Merge ideas/sketches. Results will go in 'Babsies Models.' • 38 items • Updated about 1 hour ago • 1
Babsie's Models Collection Finished models not in process of being built or repaired. If a model disappears from this list, I'm fixing a bug and it will return • 9 items • Updated 4 days ago • 2
view article Article AetherMind_SRL: Self-Reflective Learning for Robust Natural Language Inference 6 days ago • 1
view article Article The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling 9 days ago • 11
RLCR Collection Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6 • 7
GAD-Models Collection Model checkpoints of Black-Box On-Policy Distillation of Large Language Models • 5 items • Updated 10 days ago • 5