=Paper=
{{Paper
|id=Vol-1618/INRA_paper1
|storemode=property
|title=Measuring Actual Visitor Engagement in News Websites
|pdfUrl=https://ceur-ws.org/Vol-1618/INRA_paper1.pdf
|volume=Vol-1618
|authors= Razib Iqbal,Matthew Scott,Tarah Cleveland
|dblpUrl=https://dblp.org/rec/conf/um/IqbalSC16
}}
==Measuring Actual Visitor Engagement in News Websites==
Measuring Actual Visitor Engagement in News Websites Razib Iqbal Matthew Scott Tarah Cleveland Missouri State University Missouri State University Valley City State University Springfield, MO, USA Springfield, MO, USA Valley City, ND, USA riqbal@missouristate.edu matthew529@live.missouristate.edu tarah.cleveland@vcsu.edu ABSTRACT online. According to [4], engagement refers to the degree of and depth of visitor interaction on a website against a defined set of As revenues from Internet advertising continue to grow, advertisers goals. Therefore, the obvious question is - How do we measure seek popular news websites for placing advertisements in an effort visitor engagement? In [5], authors collect mouse tracking data to to maximize profits. An important measure of how well a website correlate with visual attention. They examine the cursor behavior is performing or how attractive it is to the advertisers is how to measure user engagement within a controlled environment. engaged the web visitors are with that website. During our However, in reality, web visitors are in an uncontrolled background study, we explored articles covering metrics to environment where they can be away from the computer screen or measure online user activity and engagement. However, none of looking at another application while a specific news website is open those proposed techniques address the need for advertisers and/or in the background. website owners to detect tab/application switching or to find the actual away time from a particular website. To address this need, in Besides academic research, companies in the private sector are this paper, we propose two new metrics, focus ratio and active conducting research for marketers and suggest solutions for ratio, along with a proof of concept tool to track web visitor optimizing web engagement in an effort to maximize advertisement engagement more accurately. To the best of our knowledge, focus revenues. For example, in 2008, Forrester Research Inc. published ratio and active ratio have not been proposed in the literature within the following five tools and technologies to measure engagement the context of web analytics and visitor engagement. [7]: 1. A centralized customer data repository to establish a system of record. 2. Analytics tools to help understand customer behavior. 3. Brand monitoring tools to amplify customer feedback cross CCS Concepts media channels. 4. Ethnographic research to expand the breadth of CCS → Human-centered computing → Human computer customer insight. 5. Measurement dashboards to share results with interaction (HCI) → HCI design and evaluation methods a wide audience. However, measurement of actual visitor engagement lacks Keywords standardization and validated methods for measuring engagement, especially in an online context. The current web engagement Active ratio; focus ratio; online news; visitor engagement; web metrics include a variety of measures, e.g. session duration analytics. (duration index), page views (click index), visits per visitor (recency index), conversion rates, customer satisfaction, brand 1. INTRODUCTION index, interaction index, loyalty index, etc. [6]. Also, web user’s activity can be tracked by means of teleporting, backpaging, In the United States, Internet advertising revenues totaled hyperlinking, etc. These metrics give web advertisers a better nearly $42.8 billion in 2013 which is a 17% increase from the year indication of the sites for best placement of their advertisements 2012 [1]. With advertising being a large part of the revenue of (ads for short). websites, many tools have been made to track their effectiveness. These tools have been able to track many things such as page views, Our preliminary study has revealed that there are some bounce rates, clicks, scrolls and many more metrics. Also, fundamental problems with current measurements for web visitor numerous methods have been proposed in the literature to measure engagement. For example, session duration which tracks visit time user engagement (e.g. [2][3][5]), however, little has been done to on a web page does not take into consideration the physical away assess the user engagement in terms of actual length of the user time of the web visitor from the computer or when a visitor interaction. Therefore, our aim is to be able to track the actual switches to a different tab or application. While advertisers interaction time of the web visitor from page load to when they commonly rely on a visitor’s time spent looking at the website, finally close out of the page. traditional web analytics tools lack capability to measure this accurately. Therefore, in order to address the need for an efficient To better understand new methods for defining how web web engagement tool, in Section 3, we present two new metrics visitor engagement is measured when tabs or applications are focus ratio and active ratio for accurately measuring session switched, we briefly introduce the current definitions of user duration and web visitor engagement. engagement. Authors in [9] described user engagement as the emotional, cognitive and behavioral connection that exists between a user and a resource. Therefore, we can say that higher user 2. ENGAGEMENT METRICS AND TOOLS engagement refers to a website’s ability to hold the attention of a User engagement is measured at large-scale through analytic visitor and/or motivate the visitor to participate in an activity tools assessing users’ depth of interaction with a website, which Copyright is held by the authors. include metrics such as dwell time [9] and clickthrough rate [10]. Authors in [8] explored user comments on web pages to devise a INRA 2016 Workshop, Halifax, Canada. tool for measuring user engagement. Their initiative is focused on human curiosity in online news engagement. In [9], authors counterpart of the focus method. This method is triggered when an adopted a framework for user engagement through researching element loses focus. The blur method is used to remove focus from existing information retrieval metrics, user engagement metrics, an element. Both focus and blur methods are supported by Chrome, web analytics, and measures from immersion in gaming. Other web Internet Explorer, Firefox, Safari, and Opera web browsers. engagement metrics in [4] and [6] include session duration, page Through further investigation, we found that the focus and blur views per session, visits per visitor, loyalty, interaction, and brand methods give a lot of false positives. For example, if a user displays index. a smaller window on top of the browser window, the browser window loses focus, so the blur is raised. However, the user is still Session duration is a measure of the time a visitor spends on a able to see the browser. Relying on user activity also gives a lot of webpage in a given session. By subtracting the last timestamp on a false positive as well. Consider a user watching a video where the given session from the first timestamp on the session, one can user may not move her mouse, click, or produce any key strokes, calculate session duration [4]. Session duration neither takes into but the user is still active on the web page. account the possibility of a web visitor leaving the computer with a webpage still loaded in the browser nor does it indicate if the visitor Finally, the Page Visibility API [12] is a way for website has changed tabs or switched applications. As long as the webpage designers to determine the visibility status of a particular web page. is still loaded on the visitor’s screen, the session duration will It relies on the HTML5 specification. The Page Visibility API increase - whether the visitor is engaged in the site or not. reveals when a webpage is visible or in focus. When a user switches between tabs or minimizes the current tab, the API triggers the Page views can be measured by dividing the number of page visibility change event to notify the developers when the visibility views in a given period of time by the number of visits in a given state changes. period of time [4]. This considers the number of clicks as well as the total amount of information or content the visitor has viewed. Of the metrics and tools discussed so far, it is safe to say that The issue with using the number of page views as a metric alone is a lot of time and research has been plugged into pinpointing a that a visitor may be familiar with the site, and go directly to the definition of web engagement. The many tools and metrics we have page he or she wishes to engage in. Whereas another visitor who is reviewed in our background study suggest a vast knowledge of less familiar with the site may click through many pages before what the engaged visitor looks like compared to that of a less finding the material they wish to engage in. The second type of engaged visitor. However, none of the definitions have addressed visitors would show a higher engagement based on the page views the following questions - How can we determine if a visitor has per session index, however, they may actually be much less switched tabs or applications? And, how long a user has actually engaged in the webpage’s content. been engaged with a particular webpage? Visits per visitor can be described as a measure of the loyalty, frequency, and recency of a visitor to a site over a period of time 3. PROPOSED FOCUS RATIO AND [4]. Visits per visitor is nearly a straight-forward count of the ACTIVE RATIO number of times a visitor has returned to a webpage. Loyalty index is used to describe visitors that have accounts with the website or We define focus ratio as the difference between the time a webpage that visit the website at least three times per week [6]. Both visits has been loaded in a browser and the time that page was actually per visitor and loyalty index metrics use a count of the times a visible in the active tab. For web advertisers, a higher focus ratio visitor has been on a particular webpage. These measures tend to indicates a more attractive location for placing ads on. On the other be less cryptic than previous metrics such as duration (time elapsed) hand, we define active ratio as the difference between the time a and page depth (number of clicks in a page). At the same time, webpage is visible in the active tab and the user is actually recency index and loyalty index fail to differentiate between the considered viewing or interacting with that page. Both focus ratio visitors that are highly engaged but only visit the site once and the and active ratio will help advertisers to determine the worthiness of visitors that visit the site frequently, but are lowly engaged in the placing an ad in a specific webpage at a relatively higher rate. webpage’s content. The benefit of tracking the ratio of active time a user is on the site Brand index measures the level of attention a web visitor is compared with just seeing how long they are active is that we can giving to a site’s brand prior to landing on the site [6]. This is check areas of our websites that might be causing web visitor to measured by examining incoming search phrases and comparing become inactive. If we have pages or news articles on our websites those phrases to brand keywords. If the incoming phrase matches that have a very low active ratio, then we can take a look at these the brand keyword or phrase, then the brand index measurement is pages and see what it is about these pages or news contents that used. might be causing a user to leave the page. This is something we can measure with our proposed active and focus ratios instead of just Interaction index is the engagement that occurs when a visitor simply tracking the average active time of that page. Also, having is directly interacting with the site through commenting, posting in an average active ratio for each webpage on a news website, we forums, and/or engaging in discussion boards [6]. Like the brand will be able to find the contents that are the most appealing to the index, interaction index will score positively if the visitor news readers. These will be the pages or contents that web visitors completes any defined interactions during the session [4]. interact with the most, and thus we can promote these contents or During our investigation phase, we set out to find resources categories to the prospective advertisers. Knowing which of our that would serve as guidelines for implementing a new tool to webpages are the most active will also make the ads more effective accurately detect web visitor engagement. Initially, we found two and better targeted. Therefore, high active and focus ratio pages methods: focus and blur. The focus method is triggered when a will be a major selling points for ad spaces in an online news visitor focuses on an element. This focus is often measured by website. Showing these metrics to the potential ad space buyers or mouse movements or clicks [11]. The blur method is the bidders will help us to be able to charge a fair price for that space. Visitor Engagement events: Active Ratio Visitor Engagement events: Focus Ratio Figure 1. Visitor Engagement events sent to Google Analytics 4. PROOF OF CONCEPT TOOL FOR function to increase the active time. In the clock() function of Riveted, clockTime value is being increased by one every second FOCUS RATIO AND ACTIVE RATIO as long as the user is considered active. If the clockTime is greater In this section, we describe our code snippets written in JavaScript than zero and a multiple of predefined report interval, then an event and PHP to show how various times can be measured and stored to is sent to Google Analytics with the clockTime. clockTime value calculate and report the proposed focus and active ratios. The goal stops increasing when the user has not been active for the value in here is to show a tool that reports these new metrics, focus ratio and idleTimeout. So, if idleTimeout was set to 30, then after 30 seconds active ratio, and then provides an environment (in our case Google of inactivity clock() function would stop being called. Analytics) where we can use these metrics to study the trends. To Page visibility listeners can detect if the page is the active tab in the implement our concept, we used Riveted tool browser. We added two functionalities (hiddentTimeEvent() and (http://riveted.parsnip.io/) for gathering user activity related data totalIdleTime()) to Riveted in order for it to keep track of how long and to calculate the focus ratio and active ratio metrics. We then the page has been hidden. The first function is called when the send this information to Google Analytics listener notices that the page is no longer visible, or becomes (http://www.google.com/analytics/). hidden. This starts a timer that calls the function totalIdleTime() every second and stores it in the variable hiddenTime. 4.1 Tracking active and hidden time totalIdleTime(), when called, increases hiddenTime variable by With the help of the Riveted tool, we add listeners for keystrokes, one. If the user comes back to the page and it is visible again, then mouse click, mouse movement, scroll, page visibility for each of we kill the timer. the web pages in order to detect user activity. When it detects any of these events it resets the idleTimeout value by calling the clock() 4.2 Tracking total time pages including a customizable dashboard to help users to get the Tracking the total time since the page has been loaded is needed for most out of the information they are gathering. One major us to calculate the active ratio of the user. To keep track of total improvement will be to gather focus and active ratios for certain time, we added another two functions (totalTime() and sections of a webpage to accurately determine user engagements in totalTimeEvent()). We call the totalTimeEvent() as soon as Riveted different sections of a webpage. is started, i.e. the webpage has been loaded. The function totalTime() increases visitTime value by one, and then calls the 6. REFERENCES function totalTimeEvent(). The purpose of totalTimeEvent() is to [1] IAB Internet Advertising Revenue Report. Rep. Interactive wait one second before calling totalTime(). This effectively stores Advertising Bureau (2014). Retrieved May 1, 2016 from the total time since the page has been loaded, in seconds, in the http://www.iab.net/media/file/IAB_Internet_Advertising_Re variable visitTime. venue_Report_FY_2013.pdf [2] Lagun, D. & Lalmas, M., (2016), Understanding and 4.3 Sending ratios to Google Analytics Measuring User Engagement and Attention in Online News Finally, we added a functionality to Riveted to calculate the Reading, International Conference On Web Search and Data ratios based on the times we are keeping track of, and then send Mining (WSDM), San Francisco, USA. those ratios to Google Analytics for further study of the webvisitor [3] Barbieri, N., Silvestri, F., & Lalmas, M., (2016). Improving behavior. We achieve this with a function that we call once when Post-Click User Engagement on Native Ads via Survival Riveted is started. The first part of this function adds an event to Analysis. ACM international conference on World Wide Web the page that waits for the page to be closed. We then calculate the (WWW), Switzerland. active ratio, and round it to a whole number, by taking the clockTime and dividing it by the visitTime. This gives us the [4] Peterson, E. T. & Carrabis, J. Measuring the Immeasurable: percentage of time the user was active on that site. Next, we Visitor Engagement. Web Analytics Demystified, (2008). calculate the actual visible time to find the percentage of time the Retrieved May 1, 2016 from page was visible, and not hidden. We subtract the hiddenTime from http://www.webanalyticsdemystified.com/downloads/Web_ visitTime to get the visibleTime. We then calculate focus ratio by Analytics_Demystified_and_NextStage_Global_- taking the visibleTime and dividing it by the visitTime. This gives _Measuring_the_Immeasurable_-_Visitor_Engagement.pdf us the percentage of time the page was visible. The last part of the [5] Arapakis, I., Lalmas, M. & Valkanas, G., (2014). function sends these values to Google Analytics. We send an Understanding Within-Content Engagement through Pattern ‘Event’ with the category of ‘VisitorEngagement’, and label them Analysis of Mouse Gestures, ACM International Conference as ‘Active’ or ‘Focus’ appropriately. on Information and Knowledge Management (CIKM). In Figure 1, we show the screen captures of the sample Google [6] Beckett, L. Getting beyond Just Pageviews: Philly.com’s Analytics dashboard containing our visitor engagement data for the Seven-part Equation for Measuring, (2010). Retrieved May following website: http://www.squidlessgames.com/ over a period 1, 2016 from http://www.niemanlab.org/2010/10/getting- of one week. In Figure 1.A., we show the total visitor engagement beyond-just-pageviews-philly-coms-seven-part-equation-for- events reported versus how many users actually interacted with the measuring-online-engagement website referring to an active ratio of 100%, 50%, and 0%. In [7] Haven, B., & Vittal, S. Five Tools and Technologies to Figure 1.B., we show how many users (out of those who were Measure Engagement, (2008). Cambridge: Forrester interacting with the website) departed right away (0%) or were Research Inc. Retrieved May 1, 2016 from active for the full (100%) and half (50%) of their visit duration to https://www.adobe.com/enterprise/pdfs/five_tools_and_techn the website. These active and focus ratios are also available for the ologies_engagement.pdf individual webpages. For example, in the above website we hosted two games: ‘Floe’ and ‘Vocabulisitics’. Based on the collected user [8] Arapakis, I., et al. (2014). User engagement in online News: engagement data, we found that Floe has higher active and focus Under the scope of sentiment, interest, affect, and gaze. ratio compared to any other pages. Journal of the Association for Information Science and Technology. 65: 1988–2005. doi: 10.1002/asi.23096 5. CONCLUSION [9] Attfield, S., Kazai, G., Lalmas, M., & Piwowarski, B. (2011). In this paper, we presented two new metrics and a proof of concept Towards a science of user engagement (Position Paper), implementation to allow advertisers and/or news website owners to WSDM Workshop on User Modelling for Web Applications, correctly detect when a visitor is physically away or changes Hong Kong. applications in order to calculate the actual level of visitor engagement. We do not use any pre-determined concepts to [10] Chakrabarti, D., Agarwal, D., & Josifovski, V., (2008). characterize the user behavior or recommend news articles. Contextual advertising by combining relevance with click However, focus and active ratio can be used to create an actual feedback, ACM international conference on World Wide Web dwell time profiling of online news readers to help classify the news (WWW), New York, NY, USA, 417-426. articles and rank news categories. This classification and rank [11] Kantor, I. Focus/Blur: methods and events, (2011). (Oracle) information can be used by the news website owners to promote a JavaScript Tutorial. Retrieved May 1, 2016 from specific category or news article to the advertisers in order to earn http://javascript.info/tutorial/focus-blur a higher revenue. Advertisers can also choose a specific ad space, [12] Mozilla Developer Network. Using the Page Visibility API. or place a higher bid on a particular page or news category for their (2015). Retrieved May 1, 2016 from product promotion based on this information. Our tool detects user https://developer.mozilla.org/en- activity by tracking the keyboard and mouse activities as well as US/docs/Web/Guide/User_experience/Using_the_Page_Visi page visibility including tab switching in the browser. In the future, bility_API we plan to add capabilities to compare the ratios among different