Natural Language
- What is the Natural Language API? has several methods for performing analysis and annotation on your text. Each level of analysis provides valuable information for language understanding.
- What five methods are in the Natural Language API? Sentiment Analysis, Entity Analysis, Entity Sentiment Analysis, Syntactic Analysis, Content Classification
- What is entity sentiment analysis? inspects the given text for known entities (proper nouns and common nouns), returns information about those entities, and identifies the prevailing emotional opinion of the entity within the text, especially to determine a writer's attitude toward the entity as positive, negative, or neutral.
- What is syntactic analysis? extracts linguistic information, breaking up the given text into a series of sentences and tokens (generally, word boundaries), providing further analysis on those tokens.
- How does the Natural Language API detect the language? Each API call also detects and returns the language, if a language is not specified by the caller in the initial request.
Video Intelligence
- What are the two components of Video Intelligence? AutoML Video Intelligence and Video Intelligence API
- What is AutoML Video Intelligence? a graphical interface that makes it easy to train your own custom models to classify and track objects within videos, even if you have minimal machine learning experience.
- What is the Video Intelligence API? has pre-trained machine learning models that automatically recognize a vast number of objects, places, and actions in stored and streaming video.
- What features are in the Video Intelligence API? Explicit content detection, face detection, label detection, logo recognition, object tracking, person detection, shot change detection, speech transcription, text detection
- What is explicit content detection? Explicit content detection annotates a video with explicit content annotations (tags) for entities that are detected in the video or video segments provided.
- What is face detection? looks for faces in a video and returns segments in which a face is detected, optionally the bounding boxes, optionally face attributes (headwear, looks into the camera, eyes visible, glasses, mouth open, smiling)
- What is feature label detection? This feature identifies objects, locations, activities, animal species, products, and more. Label detection differs from Object tracking. Unlike object tracking, label detection provides labels for the entire frame (without bounding boxes).
- What is object tracking? Object tracking can track multiple objects detected in an input video or video segments and return labels (tags) associated with the detected entities along with the location of the entity in the frame.
- What is shot change detection? Shot change detection annotates a video with video segments that are generated when detecting abrupt shot changes in the video.
- What is speech transcription? Speech Transcription transcribes spoken audio in a video or video segment into text and returns blocks of text for each portion of the transcribed audio.
- What is text detection? Text Detection performs Optical Character Recognition (OCR) to detect visible text from frames in a video, or video segments, and returns the detected text along with information about the frame-level location and timestamp in the video for that text.
Speech-to-text
- What are the three methods to perform speech recognition? Synchronous recognition, asynchronous recognition, streaming recognition.