j-solorzano t1_jdk2kod wrote on March 24, 2023 at 11:26 PM

Reply to Cuda out of memory error by Rishh3112

If it works in CPU but not GPU, even though the GPU should have more memory, the only difference I can think of is garbage collection timing. Try calling the garbage collector in every epoch. Also, note that you have a GRU, which retains tensors.

j-solorzano t1_jd4qgot wrote on March 21, 2023 at 8:40 PM

Reply to comment by Board_Stock in Alpaca-7B and Dalai, how can I get coherent results? by Haghiri75

Take a look at Langchain and GPTIndex.

j-solorzano t1_jd16nfb wrote on March 21, 2023 at 2:04 AM

Reply to comment by Board_Stock in Alpaca-7B and Dalai, how can I get coherent results? by Haghiri75

Language models don't remember conversations by themselves. You'd have to implement a memory and then add retrieved memories to the prompt.

j-solorzano t1_jd16g7r wrote on March 21, 2023 at 2:03 AM

Reply to Alpaca-7B and Dalai, how can I get coherent results? by Haghiri75

Try adjusting the temperature.

j-solorzano t1_j9i96zm wrote on February 22, 2023 at 3:54 AM

Reply to comment by Blakut in What are more accepted hypotheses that similarly explain the aspects of hominid evolution that the "pseudoscientific" aquatic ape theory does? by KEVLAR60442

Well, there's some webbing there. Other primates have it as well, but only the ones that swim. BTW, swimming and being able to hold your breath is an adaptation.

j-solorzano t1_izyca4d wrote on December 12, 2022 at 7:24 PM

Reply to Priority of data in deep learning? by Sixo60

Without big data there would not be deep learning, just plain old machine learning.