A simple request, a short sentence, for the human brain to interpret what is meant, make the connection and initiate an appropriate reaction is easy. For a machine this is much more complicated. To control technical devices with speech requires many individual steps.

Talking to Machines - New Operating Concepts With Artificial Intelligence
Talking to Machines - New Operating Concepts With Artificial Intelligence

Article from | FESTO

 

Detecting and interpreting speech

"Give me a pen!" – this may be a very simple command, but it makes the computer work hard in the background. Firstly, the spoken sentence is turned into text. The speech recognition software must overcome many challenges in order to identify the words used by their frequency patterns: unclear pronunciation, similar-sounding words with different meanings and different intonations or dialects. By comparing them with extensive databases, in which countless examples of words and their frequency patterns are stored, the software works out what the words are.

The next step is working out the meaning of the sentence. To do this, the software sends the text to a language interface that checks it for certain keywords. Beforehand, the programmer must determine all the necessary terms and commands – called intents – as well as their synonyms, and define which action lies behind each of them. For example, ‘give’ is identified as the request to transport an object to a particular place, whilst the word ‘me’ is understood to be a person or an objective of the action.

 

Artificial intelligence finds the optimal solution

Once the interface has identified the meaning of the sentence, it supplies a context object, which is a software code with which the device control system can work. In order to give the machine a clear instruction, the artificial intelligence now gets to work using other software. This evaluates the content of the context object and at the same time gets information from various sensors about the position of the device and its surroundings. The software houses modules for different solutions which are assigned to certain actions.

The program uses all this information to construct a command, for example how and where a gripper arm should move and sends it to the device controller. The sensor technology thus detects where the pen is on the desk and what path the machine must take to pick it up and hand it to a person. The software gradually learns which solution is the best for each action and applies this knowledge to the next action.

All these complex sequences must be made in fractions of a second, because the person expects a prompt and above all correct reaction from the machine. Although after 30 years of application, voice recognition works relatively well, there is still plenty of research and development going on behind the voice control of machines – until at some point we will be able to talk as naturally with a machine as with our neighbour.

 

 

The content & opinions in this article are the author’s and do not necessarily represent the views of RoboticsTomorrow

Comments (0)

This post does not have any comments. Be the first to leave a comment below.


Post A Comment

You must be logged in before you can post a comment. Login now.

Featured Product

Knight Optical - Unlocking Precision for the Robotics and Automation Revolution

Knight Optical - Unlocking Precision for the Robotics and Automation Revolution

In the era of Industry 4.0, automation and machine learning drive the future. Knight Optical stands at the forefront, collaborating globally to supply precision optical components that power the robotics and automation sector. From UAVs, drones, and robotics to machine vision, LiDAR-driven car sensors, and renewable energy, our optics empower innovation. Optical components play an important role within the industry, including: Optical windows and domes safeguard systems in unmanned vehicles while preserving the field of view for cameras. Infrared lenses with aspheric surfaces elevate thermal imaging. Optical filters excel in machine vision, with colour glass, interference, and dichroic filters in our stock. Knight Optical provides stock components and custom solutions made to your exact specs. With every component undergoing rigorous metrology and QA checks before shipment, you are sure to experience true precision, innovation, and assurance.