Attention of 800,000 Internet peers!
Focus on product operation design choice!
Author: Li? Scale? /p & gt;
Source: Jane books
Editor: Nizi Gu Liang Jr.
The city hasn't changed, but life has changed.
First, the current situation of voice development
Phonetics is a taboo for many people who just want to set foot in the field of artificial intelligence, because from many angles, the development process of phonetics is tortuous, and there are several situations at present:
1. speech technology is difficult to realize and the preprocessing process is complicated.
2. The corpus is insufficient and few people study it.
3. The open resources in all aspects are relatively scarce, and the development is slower.
Nowadays, the era of mobile internet is still the main way to solve problems in people's lives, and the media is mobile phones, which use interfaces to transmit information. In recent years, people have become accustomed to this way and suddenly switch to the voice mode without interface, which is relatively simple and slow in transmitting information. There will be many contradictions, which feel strange to use and make many products lying dead on the street.
Although artificial intelligence is strongly supported, the start-up time is still short, and the supply of talents is difficult to meet the needs of enterprises. What enterprises want is mature engineers who can commercialize technology in a short time, which consumes huge financial and material resources. Engineers in the transitional stage, or students majoring in related majors, are not enough to meet this standard, and their pronunciation is particularly obvious, so the market is in an impetuous and anxious state.
Second, speech is the entrance of artificial intelligence.
But it means that the sound will gradually fade in development and even become a bubble. I think not only that, but voice is the real entrance for people to accept artificial intelligence, and it is also the most common and widely used scene in the future. The current interface gives people the space to choose and the diversity of information input, but people are more and more influenced by it, which is inefficient, tired of choosing and easy to indulge.
That's all. Let me talk about it alone.
1. Why is voice the entrance?
If you ask an ordinary person, do you think that a picture detects the license plate number and the mobile APP pushes your favorite news and products to you? Is this artificial intelligence? He may not know if it is. Maybe he didn't notice these changes, but if you ask him, is it artificial intelligence for machines to talk like people? He will not hesitate to affirm, because speaking is the most intuitive and natural way to judge a person, and people don't understand the algorithm. I don't know the data, so I know that this cold machine in front of me can communicate with me like a human being and serve me in the communication. That is intelligence, so voice is the most suitable entrance.
2. Why will the future voice application scenarios be the most extensive?
With the continuous training of the model, hospitals can judge diseases through machines that have scanned a large number of pictures, police can judge suspects through face detection, and finance can predict stocks through related algorithms. But the most suitable scene for sound is close to people's ordinary life, education, law and production. Although voice will be added, the key point that the machine really solves people's needs is not here. Is to change the input and output methods in these specific fields. In fact, it is dispensable, but in ordinary people's life, with the full coverage of voice in various scenes, after a long period of experience, adaptation and changing the way of solving problems, people will subtly get rid of the screen, get rid of the position limitation, receive output information anytime and anywhere in the most direct way, control everything in life, and finally enjoy the state that many problems in life can be solved by speaking.
Third, the future of pronunciation.
Imagine further, what will the popularity of voice be like? This will be an interconnected society. I have often heard this word describing the future recently. So how to achieve interconnection? Generally speaking, it is abstract and boring, and it is a bit suspicious to advertise voice all the time. Then I will use your normal day to talk about this life with voice connection in the future.
Voice leads the internet of everything.
1. Smart speakers wake up sleeping people.
One day in the future, at six o'clock in the morning, the smart speaker will wake you up while you are still sleeping. Just ask about the weather today. After reporting the weather, it reminds you that today is the winter solstice. Don't forget to eat jiaozi. After you briefly discuss what stuffing is better, ask the speaker to order a plate of jiaozi in a nearby restaurant to take away. Pay attention that it will be delivered by the store in 40 minutes, because running is something you have to do every day. Wear matching smart headphones when you go out, wake up your assistant and let him play some songs he usually likes to listen to. Running and running, I saw some kittens in the park. I couldn't help going to Doby to see them, but I lost track of time. The headset reminds me that the takeaway will be delivered in 5 minutes. You started running home, but it was still late. The takeaway brother called and answered it directly with headphones, telling him to put it at the door. When I got home, I took off my headphones. After washing, I woke up the speaker and let it broadcast the latest news. While listening, eating jiaozi. My password has been changed to several channels. If I'm not interested, I'll turn it off.
2. The car voice assistant will accompany you to work
Drive to work after dinner, wake up the car voice assistant after getting on the bus and let it navigate the route that is not blocked now. After all, it is the morning rush hour. Just about to start, I suddenly remembered that the light at home was not turned off, so I asked my assistant to turn off the light at home quickly and let it check if there was anything else. I'm relieved after I confirm it. After a while, the car assistant reminds you that someone sent you a WeChat. You let it read the content. It turned out that my sister, who was studying at home, had a birthday today, and you were so busy with work that you forgot. Ask the assistant to choose a pen around 200 yuan. After choosing the color, the assistant sends the picture to the car screen. Feeling good, I paid with the password, and then asked the assistant to send a reply to my sister, "I prepared a small gift for you."
3. Office speakers are helpful for work
When I got to work, I woke up the loudspeaker in the office and asked you how many tasks you had to finish today. I am afraid that I will be lazy. You ask it to set a completion time for everything and then remind you. One of them is that you need to send an email to others at the same time, so that you can directly think about the content and tell your assistant to send it to colleagues, which saves the tedious steps of interface switching. Working hard, the leader suddenly asked him to go on a business trip in other places next week. He quickly asked his assistant to book the plane ticket the day before, booked the hotel next to the airport, simply selected the room type, and quickly booked the price to continue working. When you come back from lunch, the boss calls you for a meeting through your assistant. It is very efficient for you to communicate while listening. The assistant in the conference room has copied the minutes of the meeting and sent them to the mailbox. Don't worry about missing or forgetting anything.
4. Voice assistant helps you fall asleep
Go home from work, leave the company, put on headphones, ask if the pen bought for my sister has arrived, and ask the assistant to turn on the water heater at home. Operation and waiting become one sentence. Driving home, a person's life is inevitably boring, so I used to find an assistant to chat with him for a while. He knows everything and will open some knots in your work. Chat, in a good mood, let it see what dishes are in the refrigerator, recommend a few recipes that can be made for you, choose a simple one, watch it while doing it, and play the songs you like to listen to intimately, so don't feel comfortable. I told it about tomorrow's work task before going to bed, and then I went to sleep. Tomorrow is a new day.
Fourthly, the future application of speech is common and extensive.
The story is unremarkable, but this unremarkable side is the lifestyle of most people, and pronunciation can only play its great value in this unremarkable life. (The story does not describe that the express delivery is delivered by unmanned vehicles. The waiters who eat are all automatic dining tables or robot waiters. The cleaners in the corridor are already sweeping robots. The automatic diagnosis and treatment instrument equipped in the company or on the road can check the physical condition at any time. We mainly talk about the application of voice).
It can be seen from the story that although he is just an ordinary office worker, when using voice equipment to serve him, it will include but not limited to the above-mentioned logistics, take-away, e-commerce, taking notes, reminding, sending messages, navigating, reading words, chatting at home, listening to songs, answering phones, paying, news and many other fields. We always say that the richness of Internet application scenarios gives artificial intelligence more room for development, but it is precisely because of the richness of application scenarios. It is not so easy to meet the ordinary needs in people's lives. Once the richness of the scene is not enough, the experience effect will be reduced. If the hardware environment can't cover the scene and the convenience of voice can't be brought into full play, there are still many things to do on the road of voice.
The significance of using pronunciation is to increase our efficiency, diversity and interest in solving problems. We don't deliberately use pronunciation, but when we want to finish something quickly at any time, we can naturally solve it by saying a word, which requires machines to do their preliminary work to the extreme and build an ecological circle rich in software and hardware.
V. Problems that still need to be overcome in voice application
There are several problems in the story that need to be overcome in technology and product design:
1. How to connect and control a variety of hardware (headphones, speakers, household appliances), interoperate software and update content in real time.
2. How to realize long-distance voice dictation with headphones and speakers, and design the function of deleting and modifying at any time.
3. How to book all kinds of tickets and how to purchase them?
It's not a problem for computers to use algorithms to screen commodities. How do they receive the results in the most convenient way for users?
5. How to actively push messages or actively serve some functions for users in what form and at what time according to their living habits for a period of time?
6. Can the machine do two or more things in parallel at the same time (play music and say recipes)?
7. When chatting, you can't always reply from the bottom. How to solve encyclopedic problems, professional problems and even emotional needs, with a neutral attitude and rational analysis, may not only be a technical matter.
8. Because of the diversity of scenes, you can't always be awake. How to remind users of dates, schedules, new messages and other information without waking them up?
9. How to get rid of interface payment and have high security at the same time?
10. How to control the amount of information, rhythm, speed and volume of speech, so that users will not reject it?
1 1. How to design the instant use of words, splashing water and voice, and give users time to confirm. Hint? Or repeat it?
Conclusion of intransitive verbs
Because the audience of voice is every ordinary person, the experience of products is more important than other directions. On the contrary, it is good to satisfy people with the accuracy of technology, so the requirements for interaction designers and product managers are higher.
With the above problems, let me summarize and analyze the layout of applications, chips and open platforms of AI company now, and see which problems have been solved, which have a long way to go, which are more focused on technology and which are more focused on design, which is conducive to the company to find accurate positioning, competing products, development direction and ideas to solve problems.
Submit e-mail:
Was this article written by Li Xiao? Weighing? Hey? What? Witch test? What's your problem? Are you lazy? Pregnant? What is the number of plagues? Hey? Hey? Hey? Fold? /p & gt;