Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Sean Tseng's picture

Sean Tseng

seantyh

·

seantyh

AI & ML interests

Computational linguistics, psycholinguistics, NLP, lexical semantics, lexical resources

Organizations

seantyh 's collections 5

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 23
Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 83
Localizing and Editing Knowledge in Text-to-Image Generative Models

Paper • 2310.13730 • Published Oct 20, 2023 • 7

Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Paper • 2309.10150 • Published Sep 18, 2023 • 25

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85
Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8, 2024 • 22

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 40
Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 55
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 56
Jointly Training Large Autoregressive Multimodal Models

Paper • 2309.15564 • Published Sep 27, 2023 • 8

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 23
Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 83
Localizing and Editing Knowledge in Text-to-Image Generative Models

Paper • 2310.13730 • Published Oct 20, 2023 • 7

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85
Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8, 2024 • 22

Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

Paper • 2309.10150 • Published Sep 18, 2023 • 25

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 40
Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 55
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 56
Jointly Training Large Autoregressive Multimodal Models

Paper • 2309.15564 • Published Sep 27, 2023 • 8

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs