Spaces:
Running
on
Zero
Running
on
Zero
dung-vpt-uney
commited on
Commit
·
83428d7
1
Parent(s):
68aa06c
Update Visual-CoT demo - 2025-10-12 23:10:09
Browse filesFixes:
- Fix LLaVA config registration error (compatibility with newer transformers)
- Update Gradio to latest version (security fixes)
- Auto-deployed via update script
llava/model/language_model/modeling_llamantk.py
CHANGED
|
@@ -144,6 +144,7 @@ def call_flash_attn_qkvpacked(qkv, cu_seqlens, max_seqlen, dropout_p=0.0, softma
|
|
| 144 |
cu_seqlens_k=cu_seqlens,
|
| 145 |
max_seqlen_q=max_seqlen,
|
| 146 |
max_seqlen_k=max_seqlen,
|
|
|
|
| 147 |
)[0]
|
| 148 |
return output
|
| 149 |
elif HAS_FLASH_ATTN:
|
|
|
|
| 144 |
cu_seqlens_k=cu_seqlens,
|
| 145 |
max_seqlen_q=max_seqlen,
|
| 146 |
max_seqlen_k=max_seqlen,
|
| 147 |
+
is_causal=causal, # Add causal mask parameter
|
| 148 |
)[0]
|
| 149 |
return output
|
| 150 |
elif HAS_FLASH_ATTN:
|