Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
8.5
TFLOPS
12
13
135
Clelia Astra Bertelli
as-cle-bert
Follow
Sanshi444's profile picture
Illusionrealistic's profile picture
lakshay007's profile picture
2294 followers
ยท
40 following
https://www.clelia.dev
itsclelia
AstraBert
clelia-astra-bertelli-583904297
cle-does-things.bsky.social
AI & ML interests
Biology + Artificial Intelligence = โค๏ธ | AI for sustainable development, sustainable development for AI | Researching on Machine Learning Enhancement | I love automation for everyday things | Blogger | Open Source
Recent Activity
liked
a model
about 1 month ago
facebook/dinov2-small
posted
an
update
6 months ago
Let's pipe some ๐ฑ๐ฎ๐๐ฎ ๐ณ๐ฟ๐ผ๐บ ๐๐ต๐ฒ ๐๐ฒ๐ฏ into our vector database, shall we?๐ค With ๐ข๐ง๐ ๐๐ฌ๐ญ-๐๐ง๐ฒ๐ญ๐ก๐ข๐ง๐ ๐ฏ๐.๐.๐ (https://github.com/AstraBert/ingest-anything) you can now scrape content simply starting from URLs, extract the text from it, chunk it and put it into your favorite LlamaIndex-compatible database!๐ธ๏ธ You can do it thanks to ๐ฐ๐ฟ๐ฎ๐๐น๐ฒ๐ฒ by Apify, an open-source crawling library for python and javascript that handles all the data flow from the web: ingest-anything then combines it with ๐๐ฒ๐ฎ๐๐๐ถ๐ณ๐๐น๐ฆ๐ผ๐๐ฝ, ๐ฃ๐ฑ๐ณ๐๐๐๐ผ๐๐ป and ๐ฃ๐๐ ๐๐ฃ๐ฑ๐ณ to scrape HTML files, convert them to PDF and extract the text - hassle-free!๐ธ Check the attached code snippet if you're curious of knowing how to get started๐ฌ PS: Don't tell anybody, but this release also has another gem... It supports OpenAI models for agentic chunking, following the new releases of Chonkie๐ฆโจ If you don't want to miss out on the new features, leave us a little star on GitHub โก๏ธ https://github.com/AstraBert/ingest-anything And join our discord community! โก๏ธ https://discord.gg/kDqHNjks
posted
an
update
7 months ago
Hey there, ๐ถ๐ป๐ด๐ฒ๐๐-๐ฎ๐ป๐๐๐ต๐ถ๐ป๐ด ๐๐ญ.๐ฌ.๐ฌ just dropped with major changes: โ Embeddings: now works with Sentence Transformers, Jina AI, Cohere, OpenAI, and Model2Vec All powered via ๐๐ต๐ผ๐ป๐ธ๐ถ๐ฒโ๐ ๐๐๐๐ผ๐๐บ๐ฏ๐ฒ๐ฑ๐ฑ๐ถ๐ป๐ด๐. No more local-only limitations ๐ โ Vector DBs: now supports ๐ฎ๐น๐น ๐๐น๐ฎ๐บ๐ฎ๐๐ป๐ฑ๐ฒ๐ -๐ฐ๐ผ๐บ๐ฝ๐ฎ๐๐ถ๐ฏ๐น๐ฒ ๐ฏ๐ฎ๐ฐ๐ธ๐ฒ๐ป๐ฑ๐ Think: Qdrant, Pinecone, Weaviate, Milvus, etc. No more bottlenecks๐ โ File parsing: now plugs into any ๐๐น๐ฎ๐บ๐ฎ๐๐ป๐ฑ๐ฒ๐ -๐ฐ๐ผ๐บ๐ฝ๐ฎ๐๐ถ๐ฏ๐น๐ฒ ๐ฑ๐ฎ๐๐ฎ ๐น๐ผ๐ฎ๐ฑ๐ฒ๐ฟ Using LlamaParse, Docling or your own setup? Youโre covered. Curious of knowing more? Try it out! ๐ https://github.com/AstraBert/ingest-anything
View all activity
Organizations
as-cle-bert
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
10 months ago
view article
Article
Why we (don't) need export control
Feb 1
โข
9
published
an
article
10 months ago
view article
Article
Search the Web with AI
Jan 10
โข
5
published
an
article
11 months ago
view article
Article
Debate Championship for LLMs
Dec 30, 2024
โข
4
published
an
article
11 months ago
view article
Article
Building an AI-powered search engine from scratch
Dec 12, 2024
โข
11
published
an
article
about 1 year ago
view article
Article
streamlit_supabase_auth_ui
Nov 3, 2024
โข
4
published
an
article
about 1 year ago
view article
Article
AI is turning nuclear: a review
Oct 20, 2024
โข
10
published
an
article
over 1 year ago
view article
Article
Is AI carbon footprint worrisome?
Jul 14, 2024
โข
3
published
an
article
over 1 year ago
view article
Article
_Repetita iuvant_: how to improve AI code generation
Jul 8, 2024
โข
5
published
an
article
over 1 year ago
view article
Article
BrAIn: next generation neurons?
Jun 5, 2024
โข
15
published
an
article
over 1 year ago
view article
Article
What is going on with AlphaFold3?
May 21, 2024
โข
15