ManishThota
/

CustomModel

Text Generation

text-generation-inference

Model card Files Files and versions

ManishThota commited on Feb 12, 2024

Commit

48d243e

·

verified ·

1 Parent(s): 11b3e83

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -4,13 +4,14 @@ license: creativeml-openrail-m
 ---
 <h1 align='center' style='font-size: 36px; font-weight: bold;'>Sparrow</h1>
 <h3 align='center' style='font-size: 24px;'>Tiny Vision Language Model</h3>
-<h4 align='center', style='font-size: 18px;' >A Custom Model Enhanced for Educational Contexts: This specialized model integrates slide-text pairs from machine learning classes, leveraging a unique training approach. It connects a frozen pre-trained vision encoder (SigLip) with a frozen language model (Phi-2) through an innovative projector. The model employs attention mechanisms and language modeling loss to deeply understand and generate educational content, specifically tailored to the context of machine learning education. </h4>
 <p align="center">
   <img src="https://cdn-uploads.huggingface.co/production/uploads/650c7fbb8ffe1f53bdbe1aec/DTjDSq2yG-5Cqnk6giPFq.jpeg" width="50%" height="auto"/>
 </p>
 <p align='center' style='font-size: 16px;'>
 3B parameter model built by <a href="https://www.linkedin.com/in/manishkumarthota/">@Manish</a> using SigLIP, Phi-2, Language Modeling Loss, LLaVa data, and Custom setting training dataset.
 The model is released for research purposes only, commercial use is not allowed.

 ---
 <h1 align='center' style='font-size: 36px; font-weight: bold;'>Sparrow</h1>
 <h3 align='center' style='font-size: 24px;'>Tiny Vision Language Model</h3>
 <p align="center">
   <img src="https://cdn-uploads.huggingface.co/production/uploads/650c7fbb8ffe1f53bdbe1aec/DTjDSq2yG-5Cqnk6giPFq.jpeg" width="50%" height="auto"/>
 </p>
+<h4 align='center', style='font-size: 16px;' >A Custom Model Enhanced for Educational Contexts: This specialized model integrates slide-text pairs from machine learning classes, leveraging a unique training approach. It connects a frozen pre-trained vision encoder (SigLip) with a frozen language model (Phi-2) through an innovative projector. The model employs attention mechanisms and language modeling loss to deeply understand and generate educational content, specifically tailored to the context of machine learning education. </h4>
 <p align='center' style='font-size: 16px;'>
 3B parameter model built by <a href="https://www.linkedin.com/in/manishkumarthota/">@Manish</a> using SigLIP, Phi-2, Language Modeling Loss, LLaVa data, and Custom setting training dataset.
 The model is released for research purposes only, commercial use is not allowed.