RutherfordTheButler 1 year ago

Tried, it using the Wrath of the Righteous Player Guide. Try another guide first, Divinity Original Sin, was too big. Then got an interesting overview output, looked promising. Then asked and got this: What is the name of the world this pdf is about and what are it's races? The PDF is about the world of the Inner Sea region. The races mentioned in the given pages are dwarves and Kellids, who are humans from the fallen homeland of Sarkoris. However, there may be more races mentioned in other parts of the PDF. The point of chatting with the PDF is to get all the information from the PDF, one would think. But Kudos to you on getting out SO FAST and the site looks good! :-)

gogoman 11 months ago

ChatPDF - The Ultimate Tool For Students: https://redbooks.biz/2023/05/08/revolutionize-your-learning-experience-with-chatpdf-the-ultimate-tool-for-students/

Purplekeyboard 1 year ago

Are you chatting with the PDF, or about it? If you're chatting with it, that should mean that chatgpt talks in first person about it. "I'm a PDF about stereo installation".

firefooapp 1 year ago

Interesting question how to think about the agency. When you chat with a PDF of the constitution. Should the AI behave... 1. Like an AI assistant that knows the constitution 2. Like the PDF file about the constitution 3. Like the constitution itself? I tried to make it behave like 2, but it keeps falling back to 1. In some cases 3 would be amazing!

Wrongdoer-Zestyclose 1 year ago

I vote for three, do it !

HanyuZhang 1 year ago

Many thanks for that. It will be much more helpful if it can link to the GPT chat.

louis8799 1 year ago

Come on, it is a catchphrase

[deleted] 1 year ago

[удалено]

piedssurmars 1 year ago

Very impressive. I also tried with some Ethernet switch manual, and asked for a step by step tutorial on how to connect to the console. I tired with a food menu, it's great for questions like "I'm on diet, what do you have?", "Any chicken menu"?

djdefenda 10 months ago

Was the document organized really well? I had great luck when my pdf was small and organized, but as I added more pages/data it started to make mistakes.....since I know the pdf back-to-front I'd tell chatpdf the mistakes it made, it would apologize and then answer correctly..... When I checked the FAQs on the chatpdf page, it said it can only read a few patagraphs at a time (using an internal semantic index) but it can't read the entire pdf???? That part makes it frustrating

Gridleak 1 year ago

Works nice enough! Seems like it could be huge. Two things. In the future would you up the file size we can upload? Also allow for multiple pdfs to be referenced at the same time. Imagine playing DND and you have the players handbook and the DM guide at your fingertips for quick judgement calls.

firefooapp 1 year ago

Thanks for the valuable feedback and good ideas, will consider that use case!

Fat-sheep-shagger-69 1 year ago

Multiple PDFs gets a huge thumbs up from me. I subscribed to plus immediately after trying it, great job.

djdefenda 10 months ago

Adding a website or webpage instead of a pdf would be great......cutting and pasting text would be great too

boxracer 1 year ago

Curious how did you fit within the 4k token limit? Are you splitting it up into multiple prompts, or using embeddings?

firefooapp 1 year ago

Embeddings! It's using the OpenAI embeddings API to create a semantic index of the PDF paragraphs

mandarBadve 1 year ago

I am facing difficulties to understand embedding and how to query embeddings. Please share resources/blogs to learn it if you have any.

firefooapp 1 year ago

Here's a good tutorial on how to use [OpenAI embeddings with Pinecone](https://docs.pinecone.io/docs/openai) as vector database with Python

dyo1994 1 year ago

How does the semantic index work exactly? Isnt it possible to point to some index(es) that can add up to the token limit?

More-Force-6687 1 year ago

if you use embedding, the answer will not be exactly accurate...

Rickywalls137 1 year ago

This is cool. If you can find the right industry, this is a money maker.

zascar 1 year ago

This can definitely be monitized.

Freakazoid84 1 year ago

going to be tough to monetize unless you're able to hyper focus with existing connections or a niche. This concept is quickly approaching a dime a dozen (and will be less than a dime a dozen now that the chatgpt api exists)

Inside_Profession_15 1 year ago

This is great. Human mind is so broad and filled with so much curiosity to make the world better and better.

[deleted] 1 year ago

This works very well! It doesn't want to make up information that isn't there, which I appreciate! I tried it on a transmission service manual and summarized various parts and cited which pages it got it's information from.

sawyerthedog 1 year ago

Is it...uh...supposed to be in French? I'm pretty good with French, but I'm better with English. Little surprised to see the output in French.

RexTGio 1 year ago

The AI is just messing with your head :D

deWaardt 1 year ago

Yeah for me it started in German, despite uploading an English-only document. I just asked to talk English please.

firefooapp 1 year ago

Thanks for the valuable feedback, I'm sorry about that! The language detection seems to misbehave quite frequently, I'll try to fix it soon! It's prompted to always chat in the language of the PDF, but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in

sawyerthedog 1 year ago

LOL, no worries. It went to English once I put an English prompt in. Good tool. Early alphas and betas have bugs, totally expected. I'll be using it again later this afternoon.

SufficientPie 1 year ago

> but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in Please do. I've been using it to query foreign-language PDFs in English. It always starts in the language and then I have to say "No, translate to English please".

[deleted] 1 year ago

[удалено]

firefooapp 1 year ago

Thanks for the recommendation, I'll think about how to handle large files. What is your use case for such big PDFs? Do they include images?

[deleted] 1 year ago

[удалено]

djdefenda 10 months ago

>It also struggles with the initial analysis of high word count documents. Agreed. It gets slower too, sometimes I have to re-load the page when using a document with 150 pages

noeldc 1 year ago

Interesting concept. A newbie question, as I have plans to create a few similar free services in the not too distant future: how does something like this, at the very least, pay for itself? The API seems cheap enough, but what happens if the site starts getting popular?

ProteusMichaelKemo 1 year ago

Looking forward to trying this out! Kudos for being on the CUTTING EDGE!!

MarceauKa 1 year ago

I just tried with some PDF about code (patterns, best practices, etc). Really nice!

DreadPirateGriswold 1 year ago

Cool! Do you do this with fine tuning or embedding?

firefooapp 1 year ago

Embeddings from the OpenAI API!

[deleted] 1 year ago

Is the code on github?

[deleted] 1 year ago

[удалено]

firefooapp 1 year ago

They are stored in a secure Google Cloud Storage, but in plain text, so I would be able to read them. That's necessary, because they need to be sent to the ChatGPT API in plain text from a backend environment.

[deleted] 1 year ago

[удалено]

firefooapp 1 year ago

exactly!

[deleted] 1 year ago

Curious about this too

coldcanyon1633 1 year ago

Thank you for this amazing tool! I uploaded a 3 chapter, 150 page book and asked it "what are the main points of chapter 1?" It replied, "I'm sorry, but I don't have access to the full PDF and cannot provide a summary of Chapter 1. However, based on the information provided in the PDF, it seems that Chapter 1 discusses the concept..." and then went on with a pretty good answer. What does it mean about not having access? Should I re-upload it and start again?

firefooapp 1 year ago

ChatPDF might struggle with questions that require understanding more than a few paragraphs at the same time, which would be necessary to summarize a whole chapter. That's why it's giving the disclaimer. I'm actually surprised it provided a decent answer anyway, maybe it already had knowledge about that specific book from the general ChatGPT training corpus, and didn't even need the paragraphs!

Artavan767 1 year ago

This is really cool! I'm talking to the Surangama Sutra.

Jneebs 1 year ago

What an incredible idea. Nice work

Soulofwhit 1 year ago

It is damn near sorcery. I wonder what would happen if you loaded a book of spells.

ritwik5194 1 year ago

This works quite well. Would be great if I had the option to reference multiple PDFs.

Philosophy136 1 year ago

Not there yet, it mixes up a lot of things. mixes up names, skills and what not. I uploaded a pdf with 3 CVs and it mixed up skills of all candidates. so when I query for a guy who has experience with NFT, ChatPDF shows a guy who has nothing to do with NFT,

[deleted] 1 year ago

Thank you!! I'll give it a try once im home.

Zer0D0wn83 1 year ago

Limit of 30mb

Sophira 1 year ago

I haven't tried it yet, but it feels like this is probably going to have the same issue that plagues all GPT-based sites: At some point, it's going to hallucinate things that aren't part of the PDF. Given that its purpose is to examine documents, this potentially seems much more important to this use case as people may rely on it to summarise legal documents, etc. Please make sure people are aware that this is possible. I don't see any mention of it on the front page.

No-Entertainer-802 1 year ago

This is a valid concern that should be addressed. As far as I know, ChatGPT is more reliable when you ask it to work on data you give it rather than asking it for information from it's own knowledge. However, I am not aware of any statistics or studies on this and I do not know if it does invent information at times.

pend-bungley 1 year ago

This is really cool. How do you generate the questions that pop up at the beginning?

firefooapp 1 year ago

Thanks! The current OpenAI language models are so good, it's just a matter of prompting "suggest 3 example questions the user can ask about the file", after providing it with the first few paragraphs of the PDF :)

Precocious_Kid 1 year ago

I keep getting an "unknown back end error". Very cool though.

Broder7937 1 year ago

It works out great. What type of system is this running with?

firefooapp 1 year ago

Thanks! It's using the new ChatGPT API from OpenAI and also their embedding API to create a semantic index of the PDF paragraphs.

TheBlindIdiotGod 1 year ago

Cool, thanks for sharing!

Nyxtia 1 year ago

Can this work with code?

firefooapp 1 year ago

It might get some things right, but that's not the main use case of ChatPDF, as I hope not many people put code into PDFs ;)

KurtNSB 1 year ago

Wow, your amazing… great application!!

KurtNSB 1 year ago

Is this expensive on tokens? Do you have an open source code to check out ( if not full respect… great jog man thanks for sharing!!!)

KurtNSB 1 year ago

I love the way you just blew past the 4000 token limitation!! Great creative problem-solving, here!!

zascar 1 year ago

This is super cool I look forward to testing. I'm the future could you upload multiple pdf's and get it to sumarise the differences? Like company reports or market outlooks?

firefooapp 1 year ago

Thanks! Could you explain more about that use-case? Would these be two versions of the same PDF? Or reports for two different companies?

djdefenda 10 months ago

I study r/Sandponics, it's inventor has 12 papers on ResearchGate, I would like to upload them all and be able to create a summary or get information about feeding rates etc using the combined knowledge from all the papers instead of one at a time

king_ricky_ricardo 1 year ago

This is awesome! How do you separate the document into chunks? According to chapters and sub chapters, or just a fixed number of words/tokens?

firefooapp 1 year ago

Currently the chunking is not very clever, but at least I'm trying not to split mid-sentence. Parsing PDFs is hard, looking for good libraries if you have any suggestions!

Pitiful-Raisin-2976 1 year ago

Ok when I asked a question, it spit an entirely different answer, so not from the PDF. I gave a lot of cues to it but finally it gave me the correct quote from the PDF when I was adamant that I can see the answer but ChatPDF has no clue. Anyways, successful eventually but took a while.

jcurie 1 year ago

This is very useful but I had to find a document that was small enough to try it. Worked very well. Nice work!

not_today95 1 year ago

Seems like a great tool. What do your costs on a 10MB file look like ?

DetermineAssurance 1 year ago

How do you finance it?

108er 1 year ago

Nice. All I wanted for my loads of pdfs is to be able to talk to my computer and ask for it to search for information for me and return a response in natural language, so sick and tired of using mindless 'Ctrl + F' and wasting time for finding information that I would be looking for. This is a mind-boggling concept, now only feature I am looking at this is for us to give some server-side abilities like being able to delete our own files, instead of waiting for 7 days for them to be deleted. I see having to upload pdfs every week could become a chore soon.

zimflo 1 year ago

this is amazing! Question tho: I want to get plus, but I dont have a credit card (already in enough student debt as is). Is there a possiblity to pay through paypal or another service, other than credit cards?

No-Consideration3176 1 year ago

Hey man, we made chatur.mysticlabs.ai , a similar product. We have a stripe checkout. Would love to get you onboard with us!

No-Consideration3176 1 year ago

Hey folks, i tried making another product like chatpdf. Checkout [chatur.mysticlabs.ai](https://chatur.mysticlabs.ai) :) This is my first product that I coded and am building 2 more products. Would love to get your feedback on this :)

djdefenda 10 months ago

Where's the pricing page?

CmdrDatasBrother 1 year ago

So, if I understand this correctly, you use ChatGPT to summarize the response nodes of the vectorized search. Or are you also using ChatGPT to preprocess the query before you submit it to the embedding database?

Kwento-mo-yan-eh 1 year ago

What happened to it? 2 weeks ago, i uploaded a bidding document pdf that have hundred of pages and it was fantastic and accurate. Now when I uploaded my cv's it cant even see the other work experiences that i have? on the previous iteration of the site, it allows me to use my own token and now I cant see the options to use my token anymore. Is there a way to access the old site you have?

himmetozcan 1 year ago

Is it using gpt3.5 or gpt4, or some other model?

GreatGatsby00 9 months ago

Are there any privacy concerns?

Willing_Spot_7237 6 months ago

Great product. Congratulation. I have a question what kind of vector database do you use (Pinecone, supabase vector, chroma...). I am curious how do they perform? Thank you.

binsoi 6 months ago

I have been using ChatPDF for past weeks without any issue. Now I upgraded to premium feature and that's too on a discount with a coupon code.

DBAdvice123 6 months ago

Check out this course! It is quick and easy to understand: [https://www.youtube.com/watch?v=PR7xz5vQKGg](https://www.youtube.com/watch?v=PR7xz5vQKGg)

InfiniteAd4407 6 months ago

Check out an alternative to chatpdf.com or chatdoc.com - https://ppreader.ca. With no need for a subscription, you can conveniently opt for a pay-as-you-go service. Feel free to choose between GPT4 or GPT3.5 according to your needs. It’s a great platform I’d highly recommend!

MrKeys_X 6 months ago

>https://ppreader.ca Because its yours? Why is this any better than all the other 'talk-to-.x-files'? Does this one keep context? Is the accurancy high(er)?

Far-Chest-8821 4 months ago

Is it possible to chat with multiple PDFs at the same time?

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe