azure_stt

API Wrapper for the Microsoft Azure Speech Services Speech-to-text REST API 3.0 (Cognitive Services).

Installation

Add this line to your application’s Gemfile:

ruby gem 'azure_stt'

And then execute:

bash bundle

Or install it yourself as:

bash gem install azure_stt

Azure Speech-to-text Subscription key

To be able to use the gem, you must have a subscription key. You can generate one on your Azure account.

  • If you don’t have an Azure account, you can create one for free on this page.
  • Once logged on your Azure portal, subscribe to SpeechServices in Microsoft Cognitive Service.
  • You will find two subscription keys available in ‘RESOURCE MANAGEMENT > Keys’ (‘KEY 1’ and ‘KEY 2’).

Usage

Configuration

Two environment variables are used:

  • ‘REGION’: the region of your subscription

  • ‘SUBSCRIPTION_KEY’: the API key you can generate on your Azure account.

You can look at the file env.sample and change the values. If you do not want to use environment variables, you can configure the values like so:

ruby AzureSTT.configure do |config| config.region = 'your_region' config.subscription_key = 'your_key' end

Finally, the class AzureSTT::Session uses by the default the values from the configuration, but you can initialize the session with custom values:

ruby session = AzureSTT::Session.new(region: 'your_region', subscription_key: 'your_key')

start a transcription

```ruby require ‘azure_stt’

properties = { “diarizationEnabled” => false, “wordLevelTimestampsEnabled” => false, “punctuationMode” => “DictatedAndAutomatic”, “profanityFilterMode” => “Masked” }

content_urls = [ ‘https://path.com/audio.ogg’, ‘https://path.com/audio1.ogg’]

session = AzureSTT::Session.new

transcription = session.create_transcription( content_urls: content_urls, properties: properties, locale: ‘en-US’, display_name: ‘The name of the transcription’)

You can the retrieve the results of your transcription with the id

puts transcription.id # Outputs ‘your_transcription_id’

```

Get a transcription

```ruby require ‘azure_stt’

session = AzureSTT::Session.new

transcription = session.get_transcription(‘your_transcription_id’)

Returns

# #<AzureSTT::Transcription id=”d35a802d-70ae-4358-a35d-b5faa0c75457” # # model=”” properties=# # “wordLevelTimestampsEnabled”=>false, “channels”=>[0, 1], # # “punctuationMode”=>”DictatedAndAutomatic”, “profanityFilterMode”=>”Masked”, # # “duration”=>”PT5M18S” # # links=“files”=>”https://uscentral.api.cognitive.microsoft.com/speechtotext/v3.0/transcriptions/d35a802d-70ae-4358-a35d-b5faa0c75457/files” # # last_action_date_time=#<Date: 2020-05-31 ((2459366j,0s,0n),+0s,2299161j)> created_date_time=#<Date: 2020-05-31 ((2459366j,0s,0n),+0s,2299161j)> # # status=”Succeeded” locale=”en-US” display_name=”Transcription name” files=[]>

if transcription.succeeded? # You can then access to the text, for instance : result = transcription.results.first puts result.text end ```

Development

After checking out the repo, run bin/setup to install dependencies. You can also run bin/console for an interactive prompt that will allow you to experiment.

To install this gem onto your local machine, run bundle exec rake install. To release a new version, update the version number in version.rb, and then run bundle exec rake release, which will create a git tag for the version, push git commits and tags, and push the .gem file to rubygems.org.

Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/[USERNAME]/azure_stt. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the Contributor Covenant code of conduct.

Code of Conduct

Everyone interacting in the AzureStt project’s codebases, issue trackers, chat rooms and mailing lists is expected to follow the code of conduct.