ntaylor- t1_je5qtl2 wrote on March 29, 2023 at 4:38 PM

Reply to comment by was_der_Fall_ist in [D]GPT-4 might be able to tell you if it hallucinated by Cool_Abbreviations_9

Nope. It's the same as all neural networks using transformer architecture. Just a big old series of matrix multiplications with some non linear transformations at end of the day

ntaylor- t1_je11vt1 wrote on March 28, 2023 at 5:19 PM

Reply to comment by was_der_Fall_ist in [D]GPT-4 might be able to tell you if it hallucinated by Cool_Abbreviations_9

Fairly sure the "final" gpt4 model is still using a generate function that predicts one token at a time. Just the training was good and complicated via RLHF. After training it's not doing any "complicated operations".

ntaylor- t1_je11iqf wrote on March 28, 2023 at 5:17 PM

Reply to comment by astrange in [D]GPT-4 might be able to tell you if it hallucinated by Cool_Abbreviations_9

But eventually, after RLHF, the gpt4 model is one final fixed model and still presumably uses a generate function that will be predicting next tokens based on the previous, as base gpt models/any autoregressive model does. At least that's what it seems to be doing.

ntaylor- t1_j8k7v8y wrote on February 14, 2023 at 10:27 PM

Reply to comment by bik1230 in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

I had the same thought... Im fairly sure any gpt based model can only handle 4k tokens.