Zero-shot Multimodal Document Retrieval via Cross-modal Question Generation