Subscribe to Updates
Get the latest creative news from FooBar about art, design and business.
Browsing: Multimodal
arXiv:2603.12249v1 Announce Type: cross Abstract: Constructing scientific multimodal document reasoning datasets for foundation model training involves an inherent trade-off among…
[Submitted on 28 Jul 2025 (v1), last revised 10 Mar 2026 (this version, v2)] View a PDF of the paper…
[Submitted on 27 Apr 2025 (v1), last revised 3 Mar 2026 (this version, v5)] Authors:Weidi Luo, Tianyu Lu, Qiming Zhang,…
arXiv:2603.02200v1 Announce Type: cross Abstract: The deployment of multimodal models in high-stakes domains, such as self-driving vehicles and medical diagnostics,…
[Submitted on 7 Aug 2025 (v1), last revised 23 Feb 2026 (this version, v5)] View a PDF of the paper…
arXiv:2602.08282v1 Announce Type: cross Abstract: Large-scale, cross-species plant distribution prediction plays a crucial role in biodiversity conservation, yet modeling efforts…
arXiv:2602.08868v1 Announce Type: cross Abstract: Time-series anomaly detection (TSAD) with multimodal large language models (MLLMs) is an emerging area, yet…
[Submitted on 10 Jun 2025 (v1), last revised 27 Jan 2026 (this version, v2)] View a PDF of the paper…
Full-text links: Access Paper: View a PDF of the paper titled COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source…
[Submitted on 18 Oct 2025 (v1), last revised 19 Dec 2025 (this version, v2)] View a PDF of the paper…

