Docu-Chat

AI-Powered Query Engine for Document Insights

How it works:

Frequently Asked Questions

How does Docu-Chat work?

The simple answer:

Docu-Chat is an AI-powered query engine that helps you extract insights from PDF documents. To use it:

The app will analyze the document and respond with an answer. It's like having a conversation with your document.


The technical answer:

Docu-Chat leverages several technologies to provide an interactive chat interface for querying PDF documents.

What can I upload?

At the moment Docu-Chat is designed to analyze PDF files only. For best results, upload PDFs with machine readable text. Scanned documents and PDFs without machine readable characters will result in lower quality results. There's no limit to the size or length of file you upload, but very large files will require more time to process and more OpenAI tokens (see What is an "OpenAI API Key" and where do I get one?).

Please note that although this application has been designed with user privacy and data security in mind, it is not recommended to upload sensitive documents. See the full app disclaimer at the bottom of this page.

What is an "OpenAI API Key" and where do I get one?

Good question. Here's how do I get your API Key:

This is your unique key (password) which authorizes your use of OpenAI (the clever engine that powers Docu-Chat). You can get your unique key by doing the following:

Once you have a key, you'll need to enter it into this application before you can use it – that way, OpenAI knows who's making the request!

But what is an API?

An API (Application Programming Interface) is a set of rules that allow different software applications (like Docu-Chat and OpenAI) to communicate with each other. The API defines how requests and responses (like the questions you ask) should be formatted, and what data (like the documents you upload) can be accessed.

And what is an API Key?

An API Key is basically your unique password that allows you to access the specific services provided by the API. The key helps the API provider (in this case OpenAI) to identify you, track your usage, and control access to the data and functions. Essentially, the API Key ensures that only authorized users can interact with OpenAI.

Is this free to use?

Yes and no. Let me explain:

In simple terms, the app itself is totally free to use. I don't charge anything for the app and don't make any money from it (in fact, I pay to keep it running)! But this application uses OpenAI to process your requests. OpenAI charge a very small amount of money for this, and you'll need to provide an OpenAI API Key before using this application. OpenAI will charge you approximately $0.03 for every 15,000 words you upload, and $0.04 for every 15,000 words of analysis generated. Since the average length of book is 90,000 words, it's about 18 cent to analyze a whole book (...pretty cheap).


The more technical explanation:

Docu-Chat uses OpenAI's large language model (GPT-3.5-Turbo) to process your requests via API. OpenAI charge users to make requests via API, and that's why you need to provide your unique API Key. When you make an API request, OpenAI calculate a fee based on the number of tokens used. Tokens can be thought of as a number of characters. Before the API processes your request, your input is broken down into tokens. Here are some helpful rules of thumb for understanding tokens in terms of lengths: 1 token = ~4 characters in English. 100 tokens = ~75 words. 2 sentence = ~30 tokens. 1 paragraph = ~100 tokens. 1,500 words = ~2048 tokens.

This application currently uses OpenAI's GPT-3.5-Turbo model, which costs $0.0015 / 1K tokens (input) and $0.002 / 1K tokens (output). So if you upload 1,500 words (~2,048 tokens) you'll be charged less than 1/3 of a cent ($0.003). You can read more about OpenAI's model pricing here ⧉ and calculate the number of tokens in your text here ⧉. You can maintain control of your spending by reviewing your usage and setting spending limits via your OpenAI account. 

Is there a limit to the number of questions I can ask?

There's no strict limit on the number of questions you can ask. You can engage in an ongoing conversation with the AI to extract detailed insights from your uploaded document. To keep the query engine efficient, I have limited the "working memory" so if you ask multiple questions, the engine will "forget" the older questions you've asked (see How does Docu-Chat work?)

Can I rely on the answers Docu-Chat provides?

DocuChat uses advanced AI algorithms to analyze the content of your uploaded PDF and provide answers to your questions. While it's designed to be highly accurate, it's always recommended that you cross-reference critical information with the original source. The AI's interpretation is based on the content within the document, but it can make mistakes.

Why did you build this?

Great question.

I built Docu-Chat to empower users with instant access to insights hidden within long and/or complex documents. In today's information-rich world, finding specific answers within extensive documents can be time-consuming. Docu-Chat improves efficiency and streamlines information extraction.

Other apps might already do this, but they charge premium subscription fees and put restrictive limits on a users. For example, other application cap your number of uploads and number of queries. By making an open source free-to-use version, you don't pay any unnecessary costs, you aren't locked into a subscription, and you're very welcome to clone, remix, re-purpose, or upgrade this app yourself via Hugging Face ⧉

How is this application better than other document analysis tools?

That's an easy one.

Other apps charge subscription fees and restrict your usage (for example, they limit the number of uploads, your file sizes, and the number of queries you can make). They also lock you into subscriptions, so you pay for a whole month or year of usage, even if you only analyze one file. A number of these apps are also using older AI models, resulting in lower quality answers than Docu-Chat can provide.

This application free to use and open source. You just pay for your OpenAI usage (which you're in control of). No subscriptions, no usage fees, no service costs, and no user limits. And because it's open source, you're welcome to clone, remix, re-purpose, or upgrade this app however you wish! Full code is available via Hugging Face ⧉

Feel free to GET IN TOUCH if you're interested in learning more about this application, or using generative AI.

Like Docu-Chat? Help me keep it running by buying me a coffee.

App disclaimer: This web application is currently in beta and may producer errors. Although this application has been designed with data privacy in mind, the application does not provide any guarantee. Users are recommended to avoid uploading sensitive documents or information. Answers are generated using OpenAI's large language model, and outputs may contain inaccuracies, fictional content, or potentially harmful information. Submitted information is not stored by OpenAI. For more, see OpenAI's data usage policies here. By using this tool, you acknowledge that it is provided 'as is' and without any express or implied warranties. I do not assume responsibility for any outputs. Use at your own risk and discretion. If you receive an error, would like to provide feedback, or have suggestions, please contact McKenzie.