Hou, J., Tan, Z., Hu, Q., Wang, P. and Gong, Y., 2025. Multimodal hierarchical classification using cascade-of-thought. Information Processing & Management, 63 (3), 104555.
Full text available as:
|
PDF
Multimodal hierarchical classification using cascade-of-thought.pdf - Accepted Version Restricted to Repository staff only until 21 December 2027. Available under License Creative Commons Attribution Non-commercial No Derivatives. 926kB | |
|
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.1016/j.ipm.2025.104555
Abstract
We propose Cascade-of-Thought (CSOT), a novel prompt-based method for multimodal hierarchical classification (MHC) that requires no training or labeled exemplars. Inspired by the LLM-as-a-Judge (LaaJ) paradigm, CSOT decomposes classification into rationale generation, confidence scoring, and decision ranking—each implemented via structured prompts to a vision–language model (VLM). Experiments on two public MHC benchmarks demonstrate that CSOT yields substantial performance gains, particularly for weaker VLMs, while also enhancing the output quality of near-ceiling models. CSOT offers a flexible, generalizable solution for real-world MHC tasks.
| Item Type: | Article |
|---|---|
| ISSN: | 0306-4573 |
| Uncontrolled Keywords: | Multimodal hierarchical classification; Vision language model; Multimodal reasoning; Zero-shot inference; LLM-as-a-Judge |
| Group: | Faculty of Media, Science and Technology |
| ID Code: | 41866 |
| Deposited By: | Symplectic RT2 |
| Deposited On: | 30 Mar 2026 15:13 |
| Last Modified: | 30 Mar 2026 15:13 |
Downloads
Downloads per month over past year
| Repository Staff Only - |
Tools
Tools