LLMs for text generation

Question

We know that AI is rapidly growing. do we have any large language models (LLMs) to process images, pdf documents directly (fine-tune approach) for text generation tasks?

score 5 · Accepted Answer · answered Mar 14 '24 at 07:44

5

Yes, there are open multimodal LLMs that you can fine-tune yourself, like LlaVa, NextGPT, IDEFICS or SPHINX.

Closed multimodal LLMs like GPT-4v don't offer a way to fine-tune them yet.

answered Mar 14 '24 at 07:44

noe

28,203
1
49
83

LLMs for text generation

1 Answers1