Blog | DevWorld.AI

Comparing OCR data extraction tools - EasyOCR, Tesseract-OCR, and AWS Textract

July 20, 2023 · 6 min read

Parsing printed text data is easy, but parsing data from PDFs, scans, and images is hard because they can come in various formats and quality. There may also be handwriting involved. To help solve this challenge, we need to rely on computer vision and optical character recognition (OCR). Let's explore the most popular OCR text extraction tools and compare them.

Document search tool with Node, OpenAI, Langchain, and Pinecone

July 8, 2023 · 9 min read

Ever wanted to look up any information from a document and ask it questions like you would with ChatGPT? With the power of Langchain, it's not only doable but fairly straight forward. I will show you how to get this up and running in 20 minutes.

PDF data search with Python, OpenAI, Langchain, Faiss, and Streamlit

June 10, 2023 · 8 min read

In the previous tutorial, we learned how to make a docuemnt search tool using Node.js and Pinecone, that allows us to search private data like you would with ChatGPT. In this tutorial, we will learn how to do this with Python and Faiss. In addition, we will also use Streamlit to quickly spin up a simple user interface and dynamically ask questions and get answers.

Run LLM from HuggingFace directly on your machine

May 3, 2023 · 2 min read

We will learn how to run a model from HuggingFace Hub locally on our machine, and ask it questions just like you would with ChatGPT. If you are worried about privacy issues with models hosted by someone else, this is a good place to start! Hosting your own model on your machine will give you all the control.

Run LLM directly on Hugging Face Hub

March 14, 2023 · 2 min read

Today we will learn how to connect to a model hosted on HuggingFace Hub, and ask it questions just like you would with ChatGPT. It's really simple and will only take 10 minutes.