khanhld
/

chunkformer-gender-emotion-dialect-age-classification

Audio Classification

speech-classification

speech-processing

Model card Files Files and versions

khanhld commited on 21 days ago

Commit

3b67806

·

verified ·

1 Parent(s): 1fbe913

Upload ChunkFormer Classification Model

Files changed (1) hide show

README.md +17 -10

README.md CHANGED Viewed

@@ -57,19 +57,27 @@ result = model.classify_audio(
     audio_path="path/to/your/audio.wav",
     chunk_size=-1,  # -1 for full attention
     left_context_size=-1,
-    right_context_size=-1,
-    return_probabilities=True
 )
 print(result)
 # Output example:
 # {
-#   'gender': 0,
-#   'gender_probability': [0.95, 0.05],
-#   'dialect': 3,
-#   'dialect_probability': [0.1, 0.15, 0.05, 0.7],
-#   'emotion': 5,
-#   'emotion_probability': [0.05, 0.02, 0.03, 0.08, 0.02, 0.8]
 # }
 ```
@@ -78,8 +86,7 @@ print(result)
 ```bash
 chunkformer-decode \
     --model_checkpoint khanhld/chunkformer-gender-emotion-dialect-age-classification \
-    --audio_file path/to/audio.wav \
-    --return_probabilities
 ```
 ## Training

     audio_path="path/to/your/audio.wav",
     chunk_size=-1,  # -1 for full attention
     left_context_size=-1,
+    right_context_size=-1
 )
 print(result)
 # Output example:
 # {
+#   'gender': {
+#       'label': 'female',
+#       'label_id': 0,
+#       'prob': 0.95
+#   },
+#   'dialect': {
+#       'label': 'northern dialect',
+#       'label_id': 3,
+#       'prob': 0.70
+#   },
+#   'emotion': {
+#       'label': 'neutral',
+#       'label_id': 5,
+#       'prob': 0.80
+#   }
 # }
 ```
 ```bash
 chunkformer-decode \
     --model_checkpoint khanhld/chunkformer-gender-emotion-dialect-age-classification \
+    --audio_file path/to/audio.wav
 ```
 ## Training