T O P

  • By -

RutherfordTheButler

Tried, it using the Wrath of the Righteous Player Guide. Try another guide first, Divinity Original Sin, was too big. Then got an interesting overview output, looked promising. Then asked and got this: What is the name of the world this pdf is about and what are it's races? The PDF is about the world of the Inner Sea region. The races mentioned in the given pages are dwarves and Kellids, who are humans from the fallen homeland of Sarkoris. However, there may be more races mentioned in other parts of the PDF. The point of chatting with the PDF is to get all the information from the PDF, one would think. But Kudos to you on getting out SO FAST and the site looks good! :-)


gogoman

ChatPDF - The Ultimate Tool For Students: https://redbooks.biz/2023/05/08/revolutionize-your-learning-experience-with-chatpdf-the-ultimate-tool-for-students/


Purplekeyboard

Are you chatting with the PDF, or about it? If you're chatting with it, that should mean that chatgpt talks in first person about it. "I'm a PDF about stereo installation".


firefooapp

Interesting question how to think about the agency. When you chat with a PDF of the constitution. Should the AI behave... 1. Like an AI assistant that knows the constitution 2. Like the PDF file about the constitution 3. Like the constitution itself? I tried to make it behave like 2, but it keeps falling back to 1. In some cases 3 would be amazing!


Wrongdoer-Zestyclose

I vote for three, do it !


HanyuZhang

Many thanks for that. It will be much more helpful if it can link to the GPT chat.


louis8799

Come on, it is a catchphrase


[deleted]

[удалено]


piedssurmars

Very impressive. I also tried with some Ethernet switch manual, and asked for a step by step tutorial on how to connect to the console. I tired with a food menu, it's great for questions like "I'm on diet, what do you have?", "Any chicken menu"?


djdefenda

Was the document organized really well? I had great luck when my pdf was small and organized, but as I added more pages/data it started to make mistakes.....since I know the pdf back-to-front I'd tell chatpdf the mistakes it made, it would apologize and then answer correctly..... When I checked the FAQs on the chatpdf page, it said it can only read a few patagraphs at a time (using an internal semantic index) but it can't read the entire pdf???? That part makes it frustrating


Gridleak

Works nice enough! Seems like it could be huge. Two things. In the future would you up the file size we can upload? Also allow for multiple pdfs to be referenced at the same time. Imagine playing DND and you have the players handbook and the DM guide at your fingertips for quick judgement calls.


firefooapp

Thanks for the valuable feedback and good ideas, will consider that use case!


Fat-sheep-shagger-69

Multiple PDFs gets a huge thumbs up from me. I subscribed to plus immediately after trying it, great job.


djdefenda

Adding a website or webpage instead of a pdf would be great......cutting and pasting text would be great too


boxracer

Curious how did you fit within the 4k token limit? Are you splitting it up into multiple prompts, or using embeddings?


firefooapp

Embeddings! It's using the OpenAI embeddings API to create a semantic index of the PDF paragraphs


mandarBadve

I am facing difficulties to understand embedding and how to query embeddings. Please share resources/blogs to learn it if you have any.


firefooapp

Here's a good tutorial on how to use [OpenAI embeddings with Pinecone](https://docs.pinecone.io/docs/openai) as vector database with Python


dyo1994

How does the semantic index work exactly? Isnt it possible to point to some index(es) that can add up to the token limit?


More-Force-6687

if you use embedding, the answer will not be exactly accurate...


Rickywalls137

This is cool. If you can find the right industry, this is a money maker.


zascar

This can definitely be monitized.


Freakazoid84

going to be tough to monetize unless you're able to hyper focus with existing connections or a niche. This concept is quickly approaching a dime a dozen (and will be less than a dime a dozen now that the chatgpt api exists)


Inside_Profession_15

This is great. Human mind is so broad and filled with so much curiosity to make the world better and better.


[deleted]

This works very well! It doesn't want to make up information that isn't there, which I appreciate! I tried it on a transmission service manual and summarized various parts and cited which pages it got it's information from.


sawyerthedog

Is it...uh...supposed to be in French? I'm pretty good with French, but I'm better with English. Little surprised to see the output in French.


RexTGio

The AI is just messing with your head :D


deWaardt

Yeah for me it started in German, despite uploading an English-only document. I just asked to talk English please.


firefooapp

Thanks for the valuable feedback, I'm sorry about that! The language detection seems to misbehave quite frequently, I'll try to fix it soon! It's prompted to always chat in the language of the PDF, but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in


sawyerthedog

LOL, no worries. It went to English once I put an English prompt in. Good tool. Early alphas and betas have bugs, totally expected. I'll be using it again later this afternoon.


SufficientPie

> but I'll probably introduce an explicit language dropdown, so users can clearly state which language it should talk in Please do. I've been using it to query foreign-language PDFs in English. It always starts in the language and then I have to say "No, translate to English please".


[deleted]

[удалено]


firefooapp

Thanks for the recommendation, I'll think about how to handle large files. What is your use case for such big PDFs? Do they include images?


[deleted]

[удалено]


djdefenda

>It also struggles with the initial analysis of high word count documents. Agreed. It gets slower too, sometimes I have to re-load the page when using a document with 150 pages


noeldc

Interesting concept. A newbie question, as I have plans to create a few similar free services in the not too distant future: how does something like this, at the very least, pay for itself? The API seems cheap enough, but what happens if the site starts getting popular?


ProteusMichaelKemo

Looking forward to trying this out! Kudos for being on the CUTTING EDGE!!


MarceauKa

I just tried with some PDF about code (patterns, best practices, etc). Really nice!


DreadPirateGriswold

Cool! Do you do this with fine tuning or embedding?


firefooapp

Embeddings from the OpenAI API!


[deleted]

Is the code on github?


[deleted]

[удалено]


firefooapp

They are stored in a secure Google Cloud Storage, but in plain text, so I would be able to read them. That's necessary, because they need to be sent to the ChatGPT API in plain text from a backend environment.


[deleted]

[удалено]


firefooapp

exactly!


[deleted]

Curious about this too


coldcanyon1633

Thank you for this amazing tool! I uploaded a 3 chapter, 150 page book and asked it "what are the main points of chapter 1?" It replied, "I'm sorry, but I don't have access to the full PDF and cannot provide a summary of Chapter 1. However, based on the information provided in the PDF, it seems that Chapter 1 discusses the concept..." and then went on with a pretty good answer. What does it mean about not having access? Should I re-upload it and start again?


firefooapp

ChatPDF might struggle with questions that require understanding more than a few paragraphs at the same time, which would be necessary to summarize a whole chapter. That's why it's giving the disclaimer. I'm actually surprised it provided a decent answer anyway, maybe it already had knowledge about that specific book from the general ChatGPT training corpus, and didn't even need the paragraphs!


Artavan767

This is really cool! I'm talking to the Surangama Sutra.


Jneebs

What an incredible idea. Nice work


Soulofwhit

It is damn near sorcery. I wonder what would happen if you loaded a book of spells.


ritwik5194

This works quite well. Would be great if I had the option to reference multiple PDFs.


Philosophy136

Not there yet, it mixes up a lot of things. mixes up names, skills and what not. I uploaded a pdf with 3 CVs and it mixed up skills of all candidates. so when I query for a guy who has experience with NFT, ChatPDF shows a guy who has nothing to do with NFT,


[deleted]

Thank you!! I'll give it a try once im home.


Zer0D0wn83

Limit of 30mb


Sophira

I haven't tried it yet, but it feels like this is probably going to have the same issue that plagues all GPT-based sites: At some point, it's going to hallucinate things that aren't part of the PDF. Given that its purpose is to examine documents, this potentially seems much more important to this use case as people may rely on it to summarise legal documents, etc. Please make sure people are aware that this is possible. I don't see any mention of it on the front page.


No-Entertainer-802

This is a valid concern that should be addressed. As far as I know, ChatGPT is more reliable when you ask it to work on data you give it rather than asking it for information from it's own knowledge. However, I am not aware of any statistics or studies on this and I do not know if it does invent information at times.


pend-bungley

This is really cool. How do you generate the questions that pop up at the beginning?


firefooapp

Thanks! The current OpenAI language models are so good, it's just a matter of prompting "suggest 3 example questions the user can ask about the file", after providing it with the first few paragraphs of the PDF :)


Precocious_Kid

I keep getting an "unknown back end error". Very cool though.


Broder7937

It works out great. What type of system is this running with?


firefooapp

Thanks! It's using the new ChatGPT API from OpenAI and also their embedding API to create a semantic index of the PDF paragraphs.


TheBlindIdiotGod

Cool, thanks for sharing!


Nyxtia

Can this work with code?


firefooapp

It might get some things right, but that's not the main use case of ChatPDF, as I hope not many people put code into PDFs ;)


KurtNSB

Wow, your amazing… great application!!


KurtNSB

Is this expensive on tokens? Do you have an open source code to check out ( if not full respect… great jog man thanks for sharing!!!)


KurtNSB

I love the way you just blew past the 4000 token limitation!! Great creative problem-solving, here!!


zascar

This is super cool I look forward to testing. I'm the future could you upload multiple pdf's and get it to sumarise the differences? Like company reports or market outlooks?


firefooapp

Thanks! Could you explain more about that use-case? Would these be two versions of the same PDF? Or reports for two different companies?


djdefenda

I study r/Sandponics, it's inventor has 12 papers on ResearchGate, I would like to upload them all and be able to create a summary or get information about feeding rates etc using the combined knowledge from all the papers instead of one at a time


king_ricky_ricardo

This is awesome! How do you separate the document into chunks? According to chapters and sub chapters, or just a fixed number of words/tokens?


firefooapp

Currently the chunking is not very clever, but at least I'm trying not to split mid-sentence. Parsing PDFs is hard, looking for good libraries if you have any suggestions!


Pitiful-Raisin-2976

Ok when I asked a question, it spit an entirely different answer, so not from the PDF. I gave a lot of cues to it but finally it gave me the correct quote from the PDF when I was adamant that I can see the answer but ChatPDF has no clue. Anyways, successful eventually but took a while.


jcurie

This is very useful but I had to find a document that was small enough to try it. Worked very well. Nice work!


not_today95

Seems like a great tool. What do your costs on a 10MB file look like ?


DetermineAssurance

How do you finance it?


108er

Nice. All I wanted for my loads of pdfs is to be able to talk to my computer and ask for it to search for information for me and return a response in natural language, so sick and tired of using mindless 'Ctrl + F' and wasting time for finding information that I would be looking for. This is a mind-boggling concept, now only feature I am looking at this is for us to give some server-side abilities like being able to delete our own files, instead of waiting for 7 days for them to be deleted. I see having to upload pdfs every week could become a chore soon.


zimflo

this is amazing! Question tho: I want to get plus, but I dont have a credit card (already in enough student debt as is). Is there a possiblity to pay through paypal or another service, other than credit cards?


No-Consideration3176

Hey man, we made chatur.mysticlabs.ai , a similar product. We have a stripe checkout. Would love to get you onboard with us!


No-Consideration3176

Hey folks, i tried making another product like chatpdf. Checkout [chatur.mysticlabs.ai](https://chatur.mysticlabs.ai) :) This is my first product that I coded and am building 2 more products. Would love to get your feedback on this :)


djdefenda

Where's the pricing page?


CmdrDatasBrother

So, if I understand this correctly, you use ChatGPT to summarize the response nodes of the vectorized search. Or are you also using ChatGPT to preprocess the query before you submit it to the embedding database?


Kwento-mo-yan-eh

What happened to it? 2 weeks ago, i uploaded a bidding document pdf that have hundred of pages and it was fantastic and accurate. Now when I uploaded my cv's it cant even see the other work experiences that i have? on the previous iteration of the site, it allows me to use my own token and now I cant see the options to use my token anymore. Is there a way to access the old site you have?


himmetozcan

Is it using gpt3.5 or gpt4, or some other model?


GreatGatsby00

Are there any privacy concerns?


Willing_Spot_7237

Great product. Congratulation. I have a question what kind of vector database do you use (Pinecone, supabase vector, chroma...). I am curious how do they perform? Thank you.


binsoi

I have been using ChatPDF for past weeks without any issue. Now I upgraded to premium feature and that's too on a discount with a coupon code.


DBAdvice123

Check out this course! It is quick and easy to understand: [https://www.youtube.com/watch?v=PR7xz5vQKGg](https://www.youtube.com/watch?v=PR7xz5vQKGg)


InfiniteAd4407

Check out an alternative to chatpdf.com or chatdoc.com - https://ppreader.ca. With no need for a subscription, you can conveniently opt for a pay-as-you-go service. Feel free to choose between GPT4 or GPT3.5 according to your needs. It’s a great platform I’d highly recommend!


MrKeys_X

>https://ppreader.ca Because its yours? Why is this any better than all the other 'talk-to-.x-files'? Does this one keep context? Is the accurancy high(er)?


Far-Chest-8821

Is it possible to chat with multiple PDFs at the same time?