LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 8 days ago • 55
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI By nvidia • 3 days ago • 15
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks By nvidia and 6 others • 3 days ago • 14
Can Your LLM Think Like a Professional? Introducing ProfBench By nvidia and 7 others • 3 days ago • 14
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare By nvidia • 3 days ago • 12
Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge By hugging-science and 1 other • 4 days ago • 9
LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 8 days ago • 55
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI By nvidia • 3 days ago • 15
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks By nvidia and 6 others • 3 days ago • 14
Can Your LLM Think Like a Professional? Introducing ProfBench By nvidia and 7 others • 3 days ago • 14
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare By nvidia • 3 days ago • 12
Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge By hugging-science and 1 other • 4 days ago • 9