Session (web analytics)
   HOME

TheInfoList



OR:

In
web analytics Web analytics is the measurement, collection, analysis, and reporting of web data to understand and optimize web usage. Web analytics is not just a process for measuring web traffic but can be used as a tool for business and market research and ...
, a session, or visit is a unit of measurement of a user's actions taken within a period of time or with regard to completion of a task. Sessions are also used in
operational analytics In the fields of information technology (IT) and systems management, IT operations analytics (ITOA) is an approach or method to retrieve, analyze, and report data for IT operations. ITOA may apply big data analytics to large datasets to produce bus ...
and provision of user-specific recommendations. There are two primary methods used to define a session: time-oriented approaches based on continuity in user activity and navigation-based approaches based on continuity in a chain of requested pages.


Definition

The definition of "session" varies, particularly when applied to
search engines A search engine is a software system designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a ...
. Generally, a session is understood to consist of "a sequence of requests made by a single end-user during a visit to a particular site". In the context of
search engines A search engine is a software system designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a ...
, "sessions" and "query sessions" have at least two definitions. A session or query session may be all queries made by a user in a particular time period or it may also be a series of queries or
navigations Canals or artificial waterways are waterways or engineered channels built for drainage management (e.g. flood control and irrigation) or for conveyancing water transport vehicles (e.g. water taxi). They carry free, calm surface flo ...
with a consistent underlying user need.


Uses

Sessions per user can be used as a measurement of website usage. Other metrics used within research and applied web analytics include session length, and user actions per session. Session length is seen as a more accurate alternative to measuring
page view In web analytics and website management, a pageview or page view, abbreviated in business to PV and occasionally called page impression, is a request to load a single HTML file ( web page) of an Internet site. On the World Wide Web, a page request ...
s. Reconstructed sessions have also been used to measure total user input, including to measure the number of labour hours taken to construct
Wikipedia Wikipedia is a multilingual free online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and using a wiki-based editing system. Wikipedia is the largest and most-read refer ...
. Sessions are also used for operational analytics,
data anonymization Data anonymization is a type of information sanitization whose intent is privacy protection. It is the process of removing personally identifiable information from data sets, so that the people whom the data describe remain anonymous. Overvie ...
, identifying networking anomalies, and synthetic workload generation for testing servers with artificial traffic.


Session reconstruction

Essential to the use of sessions in web analytics is being able to identify them. This is known as "session reconstruction". Approaches to session reconstruction can be divided into two main categories: time-oriented, and navigation-oriented.


Time-oriented approaches

Time-oriented approaches to session reconstruction look for a set period of user inactivity commonly called an "inactivity threshold." Once this period of inactivity is reached, the user is assumed to have left the site or stopped using the browser entirely and the session is ended. Further requests from the same user are considered a second session. A common value for the inactivity threshold is 30 minutes and sometimes described as the industry standard. Some have argued that a threshold of 30 minutes produces artifacts around naturally long sessions and have experimented with other thresholds. Others simply state: "no time threshold is effective at identifying essions. One alternative that has been proposed is using user-specific thresholds rather than a single, global threshold for the entire dataset. This has the problem of assuming that the thresholds follow a
bimodal distribution In statistics, a multimodal distribution is a probability distribution with more than one mode. These appear as distinct peaks (local maxima) in the probability density function, as shown in Figures 1 and 2. Categorical, continuous, and d ...
, and is not suitable for datasets that cover a long period of time.


Navigation-oriented approaches

Navigation-oriented approaches exploit the structure of websites - specifically, the presence of hyperlinks and the tendency of users to navigate between pages on the same website by clicking on them, rather than typing the full URL into their browser. One way of identifying sessions by looking at this data is to build a map of the website: if the user's first page can be identified, the "session" of actions lasts until they land on a page which cannot be accessed from any of the previously-accessed pages. This takes into account backtracking, where a user will retrace their steps before opening a new page. A simpler approach, which does not take backtracking into account, is to simply require that the
HTTP referer In HTTP, "" (a misspelling of Referrer) is an optional HTTP header field that identifies the address of the web page (i.e., the URI or IRI), from which the resource has been requested. By checking the referrer, the server providing the new web ...
of each request be a page that is already in the session. If it is not, a new session is created. This class of heuristics "exhibits very poor performance" on websites that contain framesets.


References


Bibliography

* * * * * * * * * * * * * * * * * * * * * * *{{cite book, last1=Weischdel, first1=Birgit, last2=Huizingh, first2=Eelko K. R. E., date=2006, title=Website optimization with web metrics: a case study, journal=Proceedings of the 8th International Conference on Electronic Commerce, pages=463, url=http://aaa.volospin.com/BT606B/Website_Optimization_p463-weischedel.pdf, doi=10.1145/1151454.1151525, isbn=978-1595933928, s2cid=2965255 Business intelligence Big data Web analytics Digital marketing