---
library_name: transformers
datasets:
- HumanLLMs/Human-Like-DPO-Dataset
language:
- en
base_model:
- HuggingFaceTB/SmolLM-135M-Instruct
pipeline_tag: text-generation
---

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->


## Model Details

### Model Description
SmolLM-135M-Instruct aligned with Direct Preference Optimization (DPO). The model was trained on 
Human-Like-DPO-Dataset, which was created for LLM fluency and making their responses more human-like.


## Training Details

### Training Data

HumanLLMs/Human-Like-DPO-Dataset

### Generation example
user: What's your morning routine like?

assistant:
As the morning kicks in, I'll provide an outline of my daily routine to help you structure your day like a pro. Remember, it's all up to you, so feel free to fill out this routine as much or as little as you like! Here's my daily morning routine:

**5:00 AM - Breakfast and Posh Cake**

* I grab my morning coffee and fill my "first cup" with a simple cup of pour-over tea or coffee.
* I put on a healthy breakfast: scrambled eggs, toast, and a piece of fruit (like an apple or banana).
* I start with a posh cake, such as a cinnamon "cookie" or a simple cinnamon bun, to kick-start the day.

**5:30 AM - Meditation Pods**

* I download meditation-themed "pod" apps, like Mindful on Tap or Habitica, to help me stay focused and relaxed.
* I enjoy some mindfulness meditation exercises, like focusing on my breath or a body scan.

**7:00 AM - 5:45 AM - Work/School**

* I spend the next 20-30 minutes working or studying