Paxata
   HOME

TheInfoList



OR:

Paxata is a privately owned software company headquartered in
Redwood City, California Redwood City is a city on the San Francisco Peninsula in Northern California's Bay Area, approximately south of San Francisco, and northwest of San Jose. Redwood City's history spans its earliest inhabitation by the Ohlone people to being a ...
. It develops self-service data preparation software that gets data ready for
data analytics Analytics is the systematic computational analysis of data or statistics. It is used for the discovery, interpretation, and communication of meaningful patterns in data. It also entails applying data patterns toward effective decision-making. It ...
software. Paxata's software is intended for
business analyst A business analyst (BA) is a person who processes, interprets and documents business processes, products, services and software through analysis of data. The role of a business analyst is to ensure business efficiency increases through their know ...
s, as opposed to technical staff. It is used to combine data from different sources, then check it for
data quality Data quality refers to the state of qualitative or quantitative pieces of information. There are many definitions of data quality, but data is generally considered high quality if it is "fit for tsintended uses in operations, decision making and ...
issues, such as duplicates and outliers. Algorithms and
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
automate certain aspects of data preparation and users work with the software through a user-interface similar to
Excel ExCeL London (an abbreviation for Exhibition Centre London) is an exhibition centre, international convention centre and former hospital in the Custom House area of Newham, East London. It is situated on a site on the northern quay of the ...
spreadsheets. The company was founded in January 2012 and operated in
stealth mode In business, stealth mode is a company's temporary state of secretiveness, usually undertaken to avoid alerting competitors to a pending product launch or another business initiative. When an entire company is in stealth mode it may attempt to ...
until October 2013. It received more than $10 million in venture funding before being acquired by DataRobot.


History

Paxata was founded in January 2012. It initially raised $2 million in venture capital. The company came out of
stealth mode In business, stealth mode is a company's temporary state of secretiveness, usually undertaken to avoid alerting competitors to a pending product launch or another business initiative. When an entire company is in stealth mode it may attempt to ...
in October 2013. Simultaneously with its public release, Paxata announced an $8 million funding round led by Accel Partners. Adoption of the software grew quickly. In March 2014,
In-Q-Tel In-Q-Tel (IQT), formerly Peleus and In-Q-It, is an American not-for-profit venture capital firm based in Arlington, Virginia. It invests in high-tech companies to keep the Central Intelligence Agency, and other intelligence agencies, equipped with ...
acquired an interest in the startup. It raised an additional $18 million in funding in September 2015. It also began working with Cisco to jointly develop the Cisco Data Preparation suite of software and services.


Software

Paxata refers to its suite of cloud-based data
quality Quality may refer to: Concepts *Quality (business), the ''non-inferiority'' or ''superiority'' of something *Quality (philosophy), an attribute or a property *Quality (physics), in response theory *Energy quality, used in various science discipli ...
,
integration Integration may refer to: Biology *Multisensory integration *Path integration * Pre-integration complex, viral genetic material used to insert a viral genome into a host genome *DNA integration, by means of site-specific recombinase technology, ...
, enrichment and governance products as "Adaptive Data Preparation." The software is intended for
business analyst A business analyst (BA) is a person who processes, interprets and documents business processes, products, services and software through analysis of data. The role of a business analyst is to ensure business efficiency increases through their know ...
s, who need to combine data from a variety of sources, then check the data for duplicates, empty fields, outliers, trends and integrity issues before conducting analysis or visualization in a third-party software tool. It uses algorithms and machine-learning to automate certain aspects of data preparation. For example, it may automatically detect records belonging to the same person or address, even if the information is formatted differently in each record in different data sets. The software has a spreadsheet-based user interface. Patterns and anomalies in the data are color-coded in the spreadsheet. Then users are provided with instructions on how to resolve data quality issues or to supplement the data with contextual information. Data sets and related quality issues can also be addressed in a collaborative environment through the "Paxata Share" feature. It runs on
Apache Spark Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of Californi ...
. According to analyst firm
Ovum The egg cell, or ovum (plural ova), is the female reproductive cell, or gamete, in most anisogamous organisms (organisms that reproduce sexually with a larger, female gamete and a smaller, male one). The term is used when the female gamete is ...
, the software is made possible through advances in
predictive analytics Predictive analytics encompasses a variety of statistical techniques from data mining, predictive modeling, and machine learning that analyze current and historical facts to make predictions about future or otherwise unknown events. In business ...
,
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
and the
NoSQL A NoSQL (originally referring to "non- SQL" or "non-relational") database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. Such databases have existed ...
data caching methodology. The software uses
semantic Semantics (from grc, σημαντικός ''sēmantikós'', "significant") is the study of reference, meaning, or truth. The term can be used to refer to subfields of several distinct disciplines, including philosophy, linguistics and comput ...
algorithms to understand the meaning of a data table's columns and pattern recognition algorithms to find potential duplicates in a data-set. It also uses indexing, text pattern recognition and other technologies traditionally found in social media and search software. One of the software's users is dairy producer
Danone Danone S.A. () is a French multinational corporation, multinational food-products corporation based in Paris. It was founded in Barcelona, Spain. It is listed on Euronext Paris where it is a component of the CAC 40 stock market index. Some of t ...
, which uses the software so that business staff can create their own reports on merchandising, supply chain and product data, without the IT department.


Reception

In its 2014 report "Cool Vendors in Data Integration and Data Quality",
Gartner Gartner, Inc is a technological research and consulting firm based in Stamford, Connecticut that conducts research on technology and shares this research both through private consulting as well as executive programs and conferences. Its clients ...
praised Paxata for developing a "business-user-friendly" data quality product that does not use code. Ventana Research said its spreadsheet-based user interface "should resonate well with business analysts," who are resistant to move away from familiar Excel-like programs. Gartner also said Paxata was recognized in the report due to its automated, algorithm-based features and how it tracks any changes made to the data. Ventana Research said Paxata was in a "noisy marketplace". According to Gartner, while Paxata is an early entrant into the market, many startups and large corporations are making investments in developing similar competing products. According to ''
Gigaom Gigaom is a technology focused analyst firm and media company. The company evolved from a blog which offered news, analysis, and opinions on startup companies, emerging technologies, and other technology related topics. It was started by Om Malik ...
'' and ''IT Business Edge'', one way Paxata differs is that it automatically merges multiple data-sets into a single table, so it can be easily imported into a visualization or analysis tool. Gartner said Paxata will have a difficult time finding a compelling pricing model, when many data discovery tools that it supplements provide some similar features. In contrast, Ventana said Paxata's pricing was "a pretty small amount" compared to the amount of time users can save.


References


External links

* {{Good article Companies based in Redwood City, California Software companies based in California American companies established in 2012 Privately held companies based in California Software companies of the United States 2012 establishments in California 2012 establishments in the United States Software companies established in 2012 Companies established in 2012