Hacker Newsnew | past | comments | ask | show | jobs | submit | patricklef's commentslogin

Must have missed it when we did our research but it looks promising. What does it excel at?


MLLMs are surprisingly bad at this out of the box and to some extent even with fine tuning. https://jina.ai/news/the-what-and-why-of-text-image-modality...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: