Collections
Discover the best community collections!
Collections including paper arxiv:2104.08211 
						
					
				- 
	
	
	
A Biomedical Entity Extraction Pipeline for Oncology Health Records in Portuguese
Paper • 2304.08999 • Published • 3 - 
	
	
	
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 85 - 
	
	
	
Robust Open-Vocabulary Translation from Visual Text Representations
Paper • 2104.08211 • Published • 1 - 
	
	
	
Poro 34B and the Blessing of Multilinguality
Paper • 2404.01856 • Published • 15 
- 
	
	
	
A Biomedical Entity Extraction Pipeline for Oncology Health Records in Portuguese
Paper • 2304.08999 • Published • 3 - 
	
	
	
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 85 - 
	
	
	
Robust Open-Vocabulary Translation from Visual Text Representations
Paper • 2104.08211 • Published • 1 - 
	
	
	
Poro 34B and the Blessing of Multilinguality
Paper • 2404.01856 • Published • 15