smart speakers
   HOME

TheInfoList



OR:

A smart speaker is a type of
loudspeaker A loudspeaker (commonly referred to as a speaker or speaker driver) is an electroacoustic transducer that converts an electrical audio signal into a corresponding sound. A ''speaker system'', also often simply referred to as a "speaker" or ...
and voice command device with an integrated
virtual assistant An intelligent virtual assistant (IVA) or intelligent personal assistant (IPA) is a software agent that can perform tasks or services for an individual based on commands or questions. The term " chatbot" is sometimes used to refer to virtua ...
that offers interactive actions and hands-free activation with the help of one "hot word" (or several "hot words"). Some smart speakers can also act as a
smart device A smart device is an electronic device, generally connected to other devices or networks via different wireless protocols (such as Bluetooth, Zigbee, near-field communication, Wi-Fi, LiFi, or 5G) that can operate to some extent interactively a ...
that utilizes
Wi-Fi Wi-Fi () is a family of wireless network protocols, based on the IEEE 802.11 family of standards, which are commonly used for local area networking of devices and Internet access, allowing nearby digital devices to exchange data by radio waves ...
,
Bluetooth Bluetooth is a short-range wireless technology standard that is used for exchanging data between fixed and mobile devices over short distances and building personal area networks (PANs). In the most widely used mode, transmission power is limi ...
and other protocol standards to extend usage beyond audio playback, such as to control
home automation Home automation or domotics is building automation for a home, called a smart home or smart house. A home automation system will monitor and/or control home attributes such as lighting, climate, entertainment systems, and appliances. It m ...
devices. This can include, but is not limited to, features such as compatibility across a number of services and platforms,
peer-to-peer Peer-to-peer (P2P) computing or networking is a distributed application architecture that partitions tasks or workloads between peers. Peers are equally privileged, equipotent participants in the network. They are said to form a peer-to-peer ...
connection through
mesh networking A mesh network is a local area network topology in which the infrastructure nodes (i.e. bridges, switches, and other infrastructure devices) connect directly, dynamically and non-hierarchically to as many other nodes as possible and cooperate wit ...
, virtual assistants, and others. Each can have its own designated interface and
features Feature may refer to: Computing * Feature (CAD), could be a hole, pocket, or notch * Feature (computer vision), could be an edge, corner or blob * Feature (software design) is an intentional distinguishing characteristic of a software ite ...
in-house, usually launched or controlled via
application Application may refer to: Mathematics and computing * Application software, computer software designed to help the user to perform specific tasks ** Application layer, an abstraction layer that specifies protocols and interface methods used in a c ...
or
home automation Home automation or domotics is building automation for a home, called a smart home or smart house. A home automation system will monitor and/or control home attributes such as lighting, climate, entertainment systems, and appliances. It m ...
software. Some smart speakers also include a screen to show the user a visual response. As of winter 2017, it is estimated by NPR and Edison Research that 39 million Americans (16% of the population over 18) own a smart speaker. A smart speaker with a touchscreen is known as a smart display. It is a smart Bluetooth device that integrates
conversational user interface A conversational user interface (CUI) is a user interface for computers that emulates a conversation with a real human. Historically, computers have relied on text-based user interfaces and graphical user interfaces (GUIs) (such as the user press ...
with display screens to augment voice interaction with images and video. They are powered by one of the common
voice assistants An intelligent virtual assistant (IVA) or intelligent personal assistant (IPA) is a software agent that can perform tasks or services for an individual based on commands or questions. The term "chatbot" is sometimes used to refer to virtual ...
and offer controls for smart home devices, feature streaming apps and web browsers with touch controls for selecting content. The first smart displays were introduced in 2017 by
Amazon Amazon most often refers to: * Amazons, a tribe of female warriors in Greek mythology * Amazon rainforest, a rainforest covering most of the Amazon basin * Amazon River, in South America * Amazon (company), an American multinational technolog ...
(
Amazon Echo Show Amazon Echo Show is a smart speaker that is part of the Amazon Echo line of products. Similarly to other devices in the family, it is designed around Amazon's virtual assistant Alexa, but additionally features a touchscreen display that can be ...
, Amazon Echo Spot).


Accuracy

According to a study by ''
Proceedings of the National Academy of Sciences of the United States of America ''Proceedings of the National Academy of Sciences of the United States of America'' (often abbreviated ''PNAS'' or ''PNAS USA'') is a peer-reviewed multidisciplinary scientific journal. It is the official journal of the National Academy of Sc ...
'' released In March 2020, the six biggest tech development companies,
Amazon Amazon most often refers to: * Amazons, a tribe of female warriors in Greek mythology * Amazon rainforest, a rainforest covering most of the Amazon basin * Amazon River, in South America * Amazon (company), an American multinational technolog ...
,
Apple An apple is an edible fruit produced by an apple tree (''Malus domestica''). Apple trees are cultivated worldwide and are the most widely grown species in the genus '' Malus''. The tree originated in Central Asia, where its wild ancest ...
,
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
, Yandex, IBM and
Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washi ...
, have misidentified more words spoken by "
black people Black is a racialized classification of people, usually a political and skin color-based category for specific populations with a mid to dark brown complexion. Not all people considered "black" have dark skin; in certain countries, often in ...
" than "
white people White is a racialized classification of people and a skin color specifier, generally used for people of European origin, although the definition can vary depending on context, nationality, and point of view. Description of populations as ...
". The systems tested errors and unreadability, with a 19 and 35 percent discrepancy for the former and 2 and 20 percent discrepancy for the latter. The North American Chapter of the Association for Computational Linguistics (NAACL) also identified a discrepancy between male and female voices. According to their research, Google's speech recognition software is 13 percent more accurate for men than women. It performs better than the systems used by Bing,
AT&T AT&T Inc. is an American multinational telecommunications holding company headquartered at Whitacre Tower in Downtown Dallas, Texas. It is the world's largest telecommunications company by revenue and the third largest provider of mobile ...
and IBM.


Privacy concerns

The built-in microphone in smart speakers is continuously listening for "hot words" followed by a command. However, these continuously listening microphones also raise privacy concerns among users. These include what is being recorded, how the data will be used, how it will be protected, and whether it will be used for invasive advertising. Furthermore, an analysis of Amazon Echo Dots showed that 30–38% of "spurious audio recordings were human conversations", suggesting that these devices capture audio other than strictly detection of the hot word.


As a wiretap

There are strong concerns that the ever-listening microphone of smart speakers presents a perfect candidate for
wiretapping Telephone tapping (also wire tapping or wiretapping in American English) is the monitoring of telephone and Internet-based conversations by a third party, often by covert means. The wire tap received its name because, historically, the monitorin ...
. In 2017, British security researcher Mark Barnes showed that pre-2017 Echos have exposed pins which allow for a compromised OS to be booted.


Voice assistance vs privacy

While
voice assistants An intelligent virtual assistant (IVA) or intelligent personal assistant (IPA) is a software agent that can perform tasks or services for an individual based on commands or questions. The term "chatbot" is sometimes used to refer to virtual ...
provide a valuable service, there can be some hesitation towards using them in various social contexts, such as in public or around other users. However, only more recently have users begun interacting with voice assistants through an interaction with smart speakers rather than an interaction with the phone. On the phone, most voice assistants have the option to be engaged by a physical button (e.g., Siri with a long press of the home button) rather than solely by hot word-based engagement in a smart speaker. While this distinction increases the privacy by limiting when the microphone is on, users felt that having to press a button first removed the convenience of voice interaction. This trade-off is not unique to voice assistants; as more and more devices come online, there is an increasing trade-off between convenience and privacy.


Factors influencing adoption

While there are many factors influencing smart speaker adoption, specifically with regards to privacy, Lau et al. define five distinct categories as pros and cons: convenience, identity as an early adopter, contributing factors, perceived lack of utility, privacy and security concerns.


Security concerns

When configured without
authentication Authentication (from ''authentikos'', "real, genuine", from αὐθέντης ''authentes'', "author") is the act of proving an assertion, such as the identity of a computer system user. In contrast with identification, the act of indicatin ...
, smart speakers can be activated by people other than the intended user or owner. For example, visitors to a home or office, or people in a publicly accessible area outside an open window, partial wall, or security fence, may be able to be heard by a speaker. One team demonstrated the ability to stimulate the microphones of smart speakers and smartphones through a closed window, from another building across the street, using a laser.


Most popular smart speaker devices and platforms


Gallery


See also

*
Smart home hub A smart home hub, sometimes also referred to as a "smart hub", "gateway'", "bridge", "controller" or "coordinator", is a control center/centre for a smart home, and enables the components of a smart home to communicate and respond to each other via ...
* Thread (network protocol) * Matter (standard)


References

{{Smart speaker Internet of things Internet radio Wireless Applications of artificial intelligence