DeepSeek-V2.5?
#6 opened about 1 year ago
		by
		
				
 goodasdgood
							
						goodasdgood
	
Repetitive generation without additional EOS token
									2
	#5 opened over 1 year ago
		by
		
				
 amrothemich
							
						amrothemich
	
Biomed Foundation Model
								1
#4 opened over 1 year ago
		by
		
				
 amrothemich
							
						amrothemich
	
Yi-34B AQLM?
#3 opened over 1 year ago
		by
		
				
 llama-anon
							
						llama-anon
	
 
							~8 tok/sec with ~5k context on vLLM with Flash Attention and `kv_cache_dtype="fp8"` on 3090TI 24GB VRAM
									2
	#2 opened over 1 year ago
		by
		
				
 ubergarm
							
						ubergarm
	
 
							