Data and Models for Extract+Think as part of Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models