Invited Talk
Workshop: 1st Workshop on Multimodal Content Moderation

Data Collection for Content Moderation


Data collection and curation is an integral, yet often overlooked component of building content moderation systems. In this presentation we'll discuss optimizing data annotation, the effects of data quality and quantity on overall model performance, techniques for identifying and alleviating biases in models, and discussing appropriate applications of synthetic data.

