Partition an audio frame into segments according to the identity of the person speaking. This service provides facilities to determine at which times each person is speaking and the gender of each speaker.
Transcribe the speech of an audio frame. A domain specific vocabulary can be provided to improve the results.
Provide correspondence between a given text and an audio frame. This service returns the begin and end positions of each segment found in the two mediums.
Find the temporal positions of the visual transitions between the shots present in a video.
Identifies the face poses in a video and their orientation (pose in front view, profile or other).
Beginning of the relationship between CRIM and LEADS: an employee of CRIM is on the advisory committee of LEADS.
First project between CRIM and LEADS which assessed CRIM's technologies with LEADS data. This project laid the foundation for the definition of VESTA's requirements.
CRIM receives funding from CANARIE's Research Software Program to develop the VESTA platform from LEADS requirements.
A beta version of VESTA is available to LEADS.
A demo version of VESTA is available to all.
Be part of our story!
Senior Advisor
Vision and Imaging Team
Innovation Director and Director
Emerging Technologies and Data Science Team
Senior Advisor
Vision and Imaging Team
Director
Speech and Text Team
Senior Advisor
Vision and Imaging Team
Senior Advisor and Lead Architect
Emerging Technologies and Data Science Team
Senior Researcher
Speech and Text Team
Researcher
Vision and Imaging Team
Senior Research Agent
Emerging Technologies and Data Science Team
Advisor
Vision and Imaging Team
Advisor
Speech and Text Team
Senior Research Agent
Emerging Technologies and Data Science Team
Senior Research Agent
Emerging Technologies and Data Science Team
Research Agent
Emerging Technologies and Data Science Team
Research Agent
Emerging Technologies and Data Science Team