This appendix provides in-depth details on implementing hateful meme detection models, insights from ablation studies, visual comparisons of Pro-Cap and basic PromptHate, and results highlighting the impact of using answers from single probing questions, suggesting optimization directions for meme detection models.