Suchit G's picture

1 9 21

Suchit G

asquirous

·

https://suchitg04.github.io/blog/

AI & ML interests

None yet

Organizations

upvoted 2 articles 7 months ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12

•

116

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

98

upvoted 2 articles 8 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30

•

185

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

403

upvoted an article 9 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

60

upvoted an article 10 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

•

428

upvoted a collection about 1 year ago

BigBanyanTree CommonCrawl (2018-2024)

7 items • Updated Oct 9, 2024 • 1

upvoted an article over 1 year ago

Article

Agentic Task Delegation - Making Agents whole again

Aug 5, 2024

•

6

upvoted a paper almost 2 years ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 247