[LW24] Megaparse

Name: [LW24] Megaparse
Rating: 5.0 (1 reviews)

Open-source File Parser optimized for LLM ingestion

5.0•1 review•

222 followers

Open-source File Parser optimized for LLM ingestion

5.0•1 review•

222 followers

Visit website

Note and writing apps

•

LLMs

File Parser optimized for LLM Ingestion. Parse PDFs, DOCX, PPTX in a format that is ideal for LLMs. All of that accessible from a python package, an API, or a queue.

Free

Launch tags:

Developer Tools•GitHub

Launch Team / Built With

Stan Girard

Quivr - Your Second Brain

Maker

📌

Hi everyone, Today I’d like to introduce you to the new Quivr project. It a simple python package, API that helps you take in documents such as PDFs, Docx, PPTx, ... and turn them into Markown It has several new abilities: * OCR * Vision Models * Table Optimization in the extraction * Open-source You can use it in any of your products where you need to parse file to then send them to an LLM or simply store it Here is how to get started: * Go to https://github.com/QuivrHQ/MegaP... * pip install megaparse * Have fun Give it a try! We’d love to hear your feedback and ideas in the comments. This is part of Supabase mega Launch Week -> https://launchweek.dev/HOME

Report

9mo ago

Sacha Dumay

AIThumbnail.so

@stan_girard great tool !!!!

Report

9mo ago

Damien Henry

ClipDrop

@stan_girard whoo! This is awesome!!! I'll try it in my next project

Report

9mo ago

Christophe Pasquier

Super

Everyone that went through the pain of parsing slides and pdf know how big a problem that solves ;) GG team!

Report

9mo ago

Stan Girard

Quivr - Your Second Brain

Maker

@christophepas Thanks mate! Let me know if you are using it and I'll gladly help you improve it

Report

9mo ago

Michael Ohana

Awesome ! How does it tackle tables in financial documents?

Report

9mo ago

Stan Girard

Quivr - Your Second Brain

Maker

@michaelohana This is a hard piece to tackle, we are currently working hard on improving tables. We are exploring some techniques. For example we are looking at combining LLM Vision models with current OCR. Passing the table to a dataframe. Would love to tell you more or help you with your use case. Ping me if need on twitter @_StanGirard

Report

9mo ago

AssemblyAI — Speech-to-Text API with diarization

Speech-to-Text API with diarization

Promoted

[LW24] Megaparse

Open-source File Parser optimized for LLM ingestion

Open-source File Parser optimized for LLM ingestion

Do you use [LW24] Megaparse?

Engineering & Development

AI

Work & Productivity

Marketing & Sales

Design & Creative

Social & Community

Finance

Product add-ons

Trending categories

Top reviewed

Trending products

Top forum threads

Engineering & Development

AI

Work & Productivity

Marketing & Sales

Design & Creative

Social & Community

Finance

Product add-ons

Trending categories

Top reviewed

Trending products

Top forum threads

Do you use [LW24] Megaparse?