Legacy | Audio to text

Command: audio
Subcommand: text
Supported Platforms: Native (legacy), UFT (legacy), Selenium, Appium

All commands must be executed using the same framework. Sharing the execution ID between different frameworks is not possible. This means that if you work with a Selenium/Appium driver, you need to call the commands using the Selenium/Appium driver (or, in the case of Reporting commands, through the Reporting SDK that works with the driver) and not as part of a UFT test or another framework.
Supported OS: Android, iOS

Converts audio to text by applying powerful neural network models. It accepts an audio file that may be recorded from a device and creates a text file that contains the textual translation of the audio file.

Important: This functionality is part of the Advanced Audio Package, a separate license that provides access to tools for testing a device's audio quality. For more information, contact Perfecto Support.

Description

The command accepts an audio file that may be recorded from a device and creates a text file that contains the textual translation of the audio file.

Parameters

Name	Value	Default	Description
deviceAudio			Identifies the recorded audio file.
key			Indicates the repository key of an audio file.
variable			Variable for retrieving the resultant text in Native Automation.
language	us-english (US English) uk-english (UK English) es-spanish (Spanish) japanese (Japanese) french (French) chinese (Chinese) portuguese (Portuguese) arabic (Arabic)	US English	Audio file language. Supported languages include US English, UK English, Spanish, Japanese, French, Chinese, Portuguese, Arabic (us-english,uk-english,es-spanish,japanese,french,chinese,portuguese,arabic)
rate	broad (Broad) narrow (Narrow)	Broad	Indicates the sampling rate of the audio recording. Possible values are narrow or broad. Default value is broad.
profile	performance (Performance) accuracy (Accuracy)	Performance	Selection of the speech-to-text infrastructure used for the conversion. Possible values: Performance (default) - uses a speedier but less accurate library accuracy - uses a more robust but slower conversion library.
phrase			Provides a list of phrases for speech-to-text library to use to avoid confusion. For example, provide the words:‘two’ and ‘four’ to avoid confusion with ‘to’ and ‘for’.

^* Mandatory parameter

Request & Response

Copy

Request

 https://mycloud.perfectomobile.com/services/executions/johnp@perfectomobile.com_controller_17-12-06_14_49_15_3616?operation=command&securityToken=<your_token>&command=audio&subcommand=text&param.repositoryFile=PUBLIC:SampleAudio_0.7mb.mp3

Copy

JSON response

{
    reason: "Success",
    timer.elapsed: "3324",
    returnValue: "my team won today and passed to the third phase",
    description: "Success",
    timer.system: "3324",
    sessionId: "johnp@perfectomobile.com_controller_17-12-06_14_49_15_3616",
    completed: "true",
    flowEndCode: "SUCCEEDED",
    executionId: "johnp@perfectomobile.com_controller_17-12-06_14_49_15_3616",
    timer.ux: "0",
    timer.device: "0"}