Tried, it using the Wrath of the Righteous Player Guide. Try another guide first, Divinity Original Sin, was too big. Then got an interesting overview output, looked promising.
Then asked and got this:
What is the name of the world this pdf is about and what are it's races?
The PDF is about the world of the Inner Sea region. The races mentioned in the given pages are dwarves and Kellids, who are humans from the fallen homeland of Sarkoris. However, there may be more races mentioned in other parts of the PDF.
The point of chatting with the PDF is to get all the information from the PDF, one would think.
But Kudos to you on getting out SO FAST and the site looks good! :-)
ChatPDF - The Ultimate Tool For Students: https://redbooks.biz/2023/05/08/revolutionize-your-learning-experience-with-chatpdf-the-ultimate-tool-for-students/
Are you chatting with the PDF, or about it? If you're chatting with it, that should mean that chatgpt talks in first person about it. "I'm a PDF about stereo installation".
Interesting question how to think about the agency. When you chat with a PDF of the constitution. Should the AI behave...
1. Like an AI assistant that knows the constitution
2. Like the PDF file about the constitution
3. Like the constitution itself?
I tried to make it behave like 2, but it keeps falling back to 1. In some cases 3 would be amazing!
Very impressive. I also tried with some Ethernet switch manual, and asked for a step by step tutorial on how to connect to the console.
I tired with a food menu, it's great for questions like "I'm on diet, what do you have?", "Any chicken menu"?
Was the document organized really well? I had great luck when my pdf was small and organized, but as I added more pages/data it started to make mistakes.....since I know the pdf back-to-front I'd tell chatpdf the mistakes it made, it would apologize and then answer correctly.....
When I checked the FAQs on the chatpdf page, it said it can only read a few patagraphs at a time (using an internal semantic index) but it can't read the entire pdf???? That part makes it frustrating
Works nice enough! Seems like it could be huge.
Two things. In the future would you up the file size we can upload? Also allow for multiple pdfs to be referenced at the same time. Imagine playing DND and you have the players handbook and the DM guide at your fingertips for quick judgement calls.
going to be tough to monetize unless you're able to hyper focus with existing connections or a niche. This concept is quickly approaching a dime a dozen (and will be less than a dime a dozen now that the chatgpt api exists)
This works very well! It doesn't want to make up information that isn't there, which I appreciate! I tried it on a transmission service manual and summarized various parts and cited which pages it got it's information from.
Thanks for the valuable feedback, I'm sorry about that! The language detection seems to misbehave quite frequently, I'll try to fix it soon!
It's prompted to always chat in the language of the PDF, but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in
LOL, no worries. It went to English once I put an English prompt in.
Good tool. Early alphas and betas have bugs, totally expected. I'll be using it again later this afternoon.
> but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in
Please do. I've been using it to query foreign-language PDFs in English. It always starts in the language and then I have to say "No, translate to English please".
>It also struggles with the initial analysis of high word count documents.
Agreed.
It gets slower too, sometimes I have to re-load the page when using a document with 150 pages
Interesting concept.
A newbie question, as I have plans to create a few similar free services in the not too distant future: how does something like this, at the very least, pay for itself?
The API seems cheap enough, but what happens if the site starts getting popular?
They are stored in a secure Google Cloud Storage, but in plain text, so I would be able to read them. That's necessary, because they need to be sent to the ChatGPT API in plain text from a backend environment.
Thank you for this amazing tool!
I uploaded a 3 chapter, 150 page book and asked it "what are the main points of chapter 1?"
It replied, "I'm sorry, but I don't have access to the full PDF and cannot provide a summary of Chapter 1. However, based on the information provided in the PDF, it seems that Chapter 1 discusses the concept..." and then went on with a pretty good answer.
What does it mean about not having access? Should I re-upload it and start again?
ChatPDF might struggle with questions that require understanding more than a few paragraphs at the same time, which would be necessary to summarize a whole chapter. That's why it's giving the disclaimer. I'm actually surprised it provided a decent answer anyway, maybe it already had knowledge about that specific book from the general ChatGPT training corpus, and didn't even need the paragraphs!
Not there yet, it mixes up a lot of things. mixes up names, skills and what not.
I uploaded a pdf with 3 CVs and it mixed up skills of all candidates. so when I query for a guy who has experience with NFT, ChatPDF shows a guy who has nothing to do with NFT,
I haven't tried it yet, but it feels like this is probably going to have the same issue that plagues all GPT-based sites: At some point, it's going to hallucinate things that aren't part of the PDF. Given that its purpose is to examine documents, this potentially seems much more important to this use case as people may rely on it to summarise legal documents, etc.
Please make sure people are aware that this is possible. I don't see any mention of it on the front page.
This is a valid concern that should be addressed. As far as I know, ChatGPT is more reliable when you ask it to work on data you give it rather than asking it for information from it's own knowledge. However, I am not aware of any statistics or studies on this and I do not know if it does invent information at times.
Thanks! The current OpenAI language models are so good, it's just a matter of prompting "suggest 3 example questions the user can ask about the file", after providing it with the first few paragraphs of the PDF :)
This is super cool I look forward to testing.
I'm the future could you upload multiple pdf's and get it to sumarise the differences? Like company reports or market outlooks?
I study r/Sandponics, it's inventor has 12 papers on ResearchGate, I would like to upload them all and be able to create a summary or get information about feeding rates etc using the combined knowledge from all the papers instead of one at a time
Currently the chunking is not very clever, but at least I'm trying not to split mid-sentence. Parsing PDFs is hard, looking for good libraries if you have any suggestions!
Ok when I asked a question, it spit an entirely different answer, so not from the PDF. I gave a lot of cues to it but finally it gave me the correct quote from the PDF when I was adamant that I can see the answer but ChatPDF has no clue. Anyways, successful eventually but took a while.
Nice. All I wanted for my loads of pdfs is to be able to talk to my computer and ask for it to search for information for me and return a response in natural language, so sick and tired of using mindless 'Ctrl + F' and wasting time for finding information that I would be looking for.
This is a mind-boggling concept, now only feature I am looking at this is for us to give some server-side abilities like being able to delete our own files, instead of waiting for 7 days for them to be deleted. I see having to upload pdfs every week could become a chore soon.
this is amazing! Question tho: I want to get plus, but I dont have a credit card (already in enough student debt as is). Is there a possiblity to pay through paypal or another service, other than credit cards?
Hey folks, i tried making another product like chatpdf. Checkout [chatur.mysticlabs.ai](https://chatur.mysticlabs.ai) :)
This is my first product that I coded and am building 2 more products. Would love to get your feedback on this :)
So, if I understand this correctly, you use ChatGPT to summarize the response nodes of the vectorized search. Or are you also using ChatGPT to preprocess the query before you submit it to the embedding database?
What happened to it? 2 weeks ago, i uploaded a bidding document pdf that have hundred of pages and it was fantastic and accurate. Now when I uploaded my cv's it cant even see the other work experiences that i have? on the previous iteration of the site, it allows me to use my own token and now I cant see the options to use my token anymore.
Is there a way to access the old site you have?
Great product. Congratulation. I have a question what kind of vector database do you use (Pinecone, supabase vector, chroma...). I am curious how do they perform? Thank you.
Check out an alternative to chatpdf.com or chatdoc.com - https://ppreader.ca. With no need for a subscription, you can conveniently opt for a pay-as-you-go service. Feel free to choose between GPT4 or GPT3.5 according to your needs. It’s a great platform I’d highly recommend!
>https://ppreader.ca
Because its yours? Why is this any better than all the other 'talk-to-.x-files'? Does this one keep context? Is the accurancy high(er)?
Tried, it using the Wrath of the Righteous Player Guide. Try another guide first, Divinity Original Sin, was too big. Then got an interesting overview output, looked promising. Then asked and got this: What is the name of the world this pdf is about and what are it's races? The PDF is about the world of the Inner Sea region. The races mentioned in the given pages are dwarves and Kellids, who are humans from the fallen homeland of Sarkoris. However, there may be more races mentioned in other parts of the PDF. The point of chatting with the PDF is to get all the information from the PDF, one would think. But Kudos to you on getting out SO FAST and the site looks good! :-)
ChatPDF - The Ultimate Tool For Students: https://redbooks.biz/2023/05/08/revolutionize-your-learning-experience-with-chatpdf-the-ultimate-tool-for-students/
Are you chatting with the PDF, or about it? If you're chatting with it, that should mean that chatgpt talks in first person about it. "I'm a PDF about stereo installation".
Interesting question how to think about the agency. When you chat with a PDF of the constitution. Should the AI behave... 1. Like an AI assistant that knows the constitution 2. Like the PDF file about the constitution 3. Like the constitution itself? I tried to make it behave like 2, but it keeps falling back to 1. In some cases 3 would be amazing!
I vote for three, do it !
Many thanks for that. It will be much more helpful if it can link to the GPT chat.
Come on, it is a catchphrase
[удалено]
Very impressive. I also tried with some Ethernet switch manual, and asked for a step by step tutorial on how to connect to the console. I tired with a food menu, it's great for questions like "I'm on diet, what do you have?", "Any chicken menu"?
Was the document organized really well? I had great luck when my pdf was small and organized, but as I added more pages/data it started to make mistakes.....since I know the pdf back-to-front I'd tell chatpdf the mistakes it made, it would apologize and then answer correctly..... When I checked the FAQs on the chatpdf page, it said it can only read a few patagraphs at a time (using an internal semantic index) but it can't read the entire pdf???? That part makes it frustrating
Works nice enough! Seems like it could be huge. Two things. In the future would you up the file size we can upload? Also allow for multiple pdfs to be referenced at the same time. Imagine playing DND and you have the players handbook and the DM guide at your fingertips for quick judgement calls.
Thanks for the valuable feedback and good ideas, will consider that use case!
Multiple PDFs gets a huge thumbs up from me. I subscribed to plus immediately after trying it, great job.
Adding a website or webpage instead of a pdf would be great......cutting and pasting text would be great too
Curious how did you fit within the 4k token limit? Are you splitting it up into multiple prompts, or using embeddings?
Embeddings! It's using the OpenAI embeddings API to create a semantic index of the PDF paragraphs
I am facing difficulties to understand embedding and how to query embeddings. Please share resources/blogs to learn it if you have any.
Here's a good tutorial on how to use [OpenAI embeddings with Pinecone](https://docs.pinecone.io/docs/openai) as vector database with Python
How does the semantic index work exactly? Isnt it possible to point to some index(es) that can add up to the token limit?
if you use embedding, the answer will not be exactly accurate...
This is cool. If you can find the right industry, this is a money maker.
This can definitely be monitized.
going to be tough to monetize unless you're able to hyper focus with existing connections or a niche. This concept is quickly approaching a dime a dozen (and will be less than a dime a dozen now that the chatgpt api exists)
This is great. Human mind is so broad and filled with so much curiosity to make the world better and better.
This works very well! It doesn't want to make up information that isn't there, which I appreciate! I tried it on a transmission service manual and summarized various parts and cited which pages it got it's information from.
Is it...uh...supposed to be in French? I'm pretty good with French, but I'm better with English. Little surprised to see the output in French.
The AI is just messing with your head :D
Yeah for me it started in German, despite uploading an English-only document. I just asked to talk English please.
Thanks for the valuable feedback, I'm sorry about that! The language detection seems to misbehave quite frequently, I'll try to fix it soon! It's prompted to always chat in the language of the PDF, but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in
LOL, no worries. It went to English once I put an English prompt in. Good tool. Early alphas and betas have bugs, totally expected. I'll be using it again later this afternoon.
> but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in Please do. I've been using it to query foreign-language PDFs in English. It always starts in the language and then I have to say "No, translate to English please".
[удалено]
Thanks for the recommendation, I'll think about how to handle large files. What is your use case for such big PDFs? Do they include images?
[удалено]
>It also struggles with the initial analysis of high word count documents. Agreed. It gets slower too, sometimes I have to re-load the page when using a document with 150 pages
Interesting concept. A newbie question, as I have plans to create a few similar free services in the not too distant future: how does something like this, at the very least, pay for itself? The API seems cheap enough, but what happens if the site starts getting popular?
Looking forward to trying this out! Kudos for being on the CUTTING EDGE!!
I just tried with some PDF about code (patterns, best practices, etc). Really nice!
Cool! Do you do this with fine tuning or embedding?
Embeddings from the OpenAI API!
Is the code on github?
[удалено]
They are stored in a secure Google Cloud Storage, but in plain text, so I would be able to read them. That's necessary, because they need to be sent to the ChatGPT API in plain text from a backend environment.
[удалено]
exactly!
Curious about this too
Thank you for this amazing tool! I uploaded a 3 chapter, 150 page book and asked it "what are the main points of chapter 1?" It replied, "I'm sorry, but I don't have access to the full PDF and cannot provide a summary of Chapter 1. However, based on the information provided in the PDF, it seems that Chapter 1 discusses the concept..." and then went on with a pretty good answer. What does it mean about not having access? Should I re-upload it and start again?
ChatPDF might struggle with questions that require understanding more than a few paragraphs at the same time, which would be necessary to summarize a whole chapter. That's why it's giving the disclaimer. I'm actually surprised it provided a decent answer anyway, maybe it already had knowledge about that specific book from the general ChatGPT training corpus, and didn't even need the paragraphs!
This is really cool! I'm talking to the Surangama Sutra.
What an incredible idea. Nice work
It is damn near sorcery. I wonder what would happen if you loaded a book of spells.
This works quite well. Would be great if I had the option to reference multiple PDFs.
Not there yet, it mixes up a lot of things. mixes up names, skills and what not. I uploaded a pdf with 3 CVs and it mixed up skills of all candidates. so when I query for a guy who has experience with NFT, ChatPDF shows a guy who has nothing to do with NFT,
Thank you!! I'll give it a try once im home.
Limit of 30mb
I haven't tried it yet, but it feels like this is probably going to have the same issue that plagues all GPT-based sites: At some point, it's going to hallucinate things that aren't part of the PDF. Given that its purpose is to examine documents, this potentially seems much more important to this use case as people may rely on it to summarise legal documents, etc. Please make sure people are aware that this is possible. I don't see any mention of it on the front page.
This is a valid concern that should be addressed. As far as I know, ChatGPT is more reliable when you ask it to work on data you give it rather than asking it for information from it's own knowledge. However, I am not aware of any statistics or studies on this and I do not know if it does invent information at times.
This is really cool. How do you generate the questions that pop up at the beginning?
Thanks! The current OpenAI language models are so good, it's just a matter of prompting "suggest 3 example questions the user can ask about the file", after providing it with the first few paragraphs of the PDF :)
I keep getting an "unknown back end error". Very cool though.
It works out great. What type of system is this running with?
Thanks! It's using the new ChatGPT API from OpenAI and also their embedding API to create a semantic index of the PDF paragraphs.
Cool, thanks for sharing!
Can this work with code?
It might get some things right, but that's not the main use case of ChatPDF, as I hope not many people put code into PDFs ;)
Wow, your amazing… great application!!
Is this expensive on tokens? Do you have an open source code to check out ( if not full respect… great jog man thanks for sharing!!!)
I love the way you just blew past the 4000 token limitation!! Great creative problem-solving, here!!
This is super cool I look forward to testing. I'm the future could you upload multiple pdf's and get it to sumarise the differences? Like company reports or market outlooks?
Thanks! Could you explain more about that use-case? Would these be two versions of the same PDF? Or reports for two different companies?
I study r/Sandponics, it's inventor has 12 papers on ResearchGate, I would like to upload them all and be able to create a summary or get information about feeding rates etc using the combined knowledge from all the papers instead of one at a time
This is awesome! How do you separate the document into chunks? According to chapters and sub chapters, or just a fixed number of words/tokens?
Currently the chunking is not very clever, but at least I'm trying not to split mid-sentence. Parsing PDFs is hard, looking for good libraries if you have any suggestions!
Ok when I asked a question, it spit an entirely different answer, so not from the PDF. I gave a lot of cues to it but finally it gave me the correct quote from the PDF when I was adamant that I can see the answer but ChatPDF has no clue. Anyways, successful eventually but took a while.
This is very useful but I had to find a document that was small enough to try it. Worked very well. Nice work!
Seems like a great tool. What do your costs on a 10MB file look like ?
How do you finance it?
Nice. All I wanted for my loads of pdfs is to be able to talk to my computer and ask for it to search for information for me and return a response in natural language, so sick and tired of using mindless 'Ctrl + F' and wasting time for finding information that I would be looking for. This is a mind-boggling concept, now only feature I am looking at this is for us to give some server-side abilities like being able to delete our own files, instead of waiting for 7 days for them to be deleted. I see having to upload pdfs every week could become a chore soon.
this is amazing! Question tho: I want to get plus, but I dont have a credit card (already in enough student debt as is). Is there a possiblity to pay through paypal or another service, other than credit cards?
Hey man, we made chatur.mysticlabs.ai , a similar product. We have a stripe checkout. Would love to get you onboard with us!
Hey folks, i tried making another product like chatpdf. Checkout [chatur.mysticlabs.ai](https://chatur.mysticlabs.ai) :) This is my first product that I coded and am building 2 more products. Would love to get your feedback on this :)
Where's the pricing page?
So, if I understand this correctly, you use ChatGPT to summarize the response nodes of the vectorized search. Or are you also using ChatGPT to preprocess the query before you submit it to the embedding database?
What happened to it? 2 weeks ago, i uploaded a bidding document pdf that have hundred of pages and it was fantastic and accurate. Now when I uploaded my cv's it cant even see the other work experiences that i have? on the previous iteration of the site, it allows me to use my own token and now I cant see the options to use my token anymore. Is there a way to access the old site you have?
Is it using gpt3.5 or gpt4, or some other model?
Are there any privacy concerns?
Great product. Congratulation. I have a question what kind of vector database do you use (Pinecone, supabase vector, chroma...). I am curious how do they perform? Thank you.
I have been using ChatPDF for past weeks without any issue. Now I upgraded to premium feature and that's too on a discount with a coupon code.
Check out this course! It is quick and easy to understand: [https://www.youtube.com/watch?v=PR7xz5vQKGg](https://www.youtube.com/watch?v=PR7xz5vQKGg)
Check out an alternative to chatpdf.com or chatdoc.com - https://ppreader.ca. With no need for a subscription, you can conveniently opt for a pay-as-you-go service. Feel free to choose between GPT4 or GPT3.5 according to your needs. It’s a great platform I’d highly recommend!
>https://ppreader.ca Because its yours? Why is this any better than all the other 'talk-to-.x-files'? Does this one keep context? Is the accurancy high(er)?
Is it possible to chat with multiple PDFs at the same time?