Machine learning helps artificial intelligence recognize patterns, create algorithms and provide models and predictions. To be able to learn and make good decisions, AI needs to be provided with training data material.
Training data might include text, voice or handwriting samples, depending on the purpose of the machine learning system they are collected for.
TEXT
Texts sets providing training material for natural language processing, for instance text classification or automatic translations.
VOICE
Audio samples used for voice recognition, for instance for transcriptions or voice commands systems.
HANDWRITING
Samples of human handwriting used for Optical Character Recognition, enabling and facilitating indexing, editing or searching through handwritten data.
ANNOTATION
Labeling various formats of content (text, image, audio, video) to enable categorization.