📣 Creators, Exciting News!
Gate Square Certified Creator Application Is Now Live!
How to apply:
1️⃣ Open App → Tap [Square] at the bottom → Click your avatar in the top right
2️⃣ Tap [Get Certified] under your avatar
3️⃣ Once approved, you’ll get an exclusive verified badge that highlights your credibility and expertise!
Note: You need to update App to version 7.25.0 or above to apply.
The application channel is now open to KOLs, project teams, media, and business partners!
Super low threshold, just 500 followers + active posting to apply!
At Gate Square, everyone can be a community leader! �
ByteDance and USTC jointly proposed DocPedia, a large multimodal document model
DocPedia, a multi-modal document model jointly developed by ByteDance and the University of Science and Technology of China, has successfully broken through the limit of resolution and reached a high resolution of 2560×2560, while the industry’s advanced multi-modal large models such as LLaVA and MiniGPT-4 process images with a resolution of 336×336, which cannot parse high-resolution document images. The result is that the research team has adopted a new approach to address the shortcomings of existing models in parsing high-resolution document images.
It is said that DocPedia can not only accurately identify image information, but also call the knowledge base to answer questions based on user needs, demonstrating the ability to understand high-resolution multimodal documents.