some tests here, keeping same data, recipe, LLM but swap out the vision encoder



they compared with ViT‑L/14 and SigLIP‑SO400, a fully convolutional ConvNeXT, and hybrid FastViT models

FastViT is like 8× smaller and 20× faster than ViT‑L/14 while staying just as smart
SWAP-0.5%
VSN3.49%
post-image
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 6
  • Repost
  • Share
Comment
0/400
ForkMongervip
· 7h ago
The speed improvement is really amazing!
View OriginalReply0
PretendingToReadDocsvip
· 7h ago
Small and fast are worth studying.
View OriginalReply0
MissedTheBoatvip
· 7h ago
The cost performance is really high.
View OriginalReply0
SilentObservervip
· 7h ago
The speed has increased a lot.
View OriginalReply0
OfflineValidatorvip
· 7h ago
The performance improvement is really good.
View OriginalReply0
GasFeeLovervip
· 7h ago
Performance doubled and efficiency improved
View OriginalReply0
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
English
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)