Empty-Revolution7570 OP t1_jcfx1wv wrote on March 16, 2023 at 3:29 PM Reply to comment by ml_head in [P] Multimedia GPT: Can ChatGPT/GPT-4 be used for vision / audio tasks just by prompt engineering? by Empty-Revolution7570 That makes sense! Based on how it works it think original stories would also work. Permalink Parent 1
Empty-Revolution7570 OP t1_jcdv1nt wrote on March 16, 2023 at 2:53 AM Reply to comment by MysteryInc152 in [P] Multimedia GPT: Can ChatGPT/GPT-4 be used for vision / audio tasks just by prompt engineering? by Empty-Revolution7570 No, it understands image through other models on hugging face, and outputs image with diffusers or OpenAI dalle Permalink Parent 1
Empty-Revolution7570 OP t1_jcdtuff wrote on March 16, 2023 at 2:43 AM Reply to comment by MysteryInc152 in [P] Multimedia GPT: Can ChatGPT/GPT-4 be used for vision / audio tasks just by prompt engineering? by Empty-Revolution7570 Yes, I included all the VFMs. I added upon those a few more, such as OpenAI Whisper. Still exploring how to incorporate video models Permalink Parent 1
[P] Multimedia GPT: Can ChatGPT/GPT-4 be used for vision / audio tasks just by prompt engineering? Submitted by Empty-Revolution7570 t3_11sfj5s on March 16, 2023 at 1:07 AM in MachineLearning 7 comments 1
Empty-Revolution7570 OP t1_jcfx1wv wrote
Reply to comment by ml_head in [P] Multimedia GPT: Can ChatGPT/GPT-4 be used for vision / audio tasks just by prompt engineering? by Empty-Revolution7570
That makes sense!
Based on how it works it think original stories would also work.