<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Pricing risk: Analysis of Irish Car Insurance Premiums</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Adrian</forename><surname>Byrne</surname></persName>
							<email>adrian.byrne@ucd.ie</email>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">CeADAR</orgName>
								<orgName type="department" key="dep2">NexusUCD</orgName>
								<orgName type="institution">University College Dublin</orgName>
								<address>
									<addrLine>Belfield Office Park, Unit 9</addrLine>
									<settlement>Clonskeagh, Dublin</settlement>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<address>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Idiro Analytics</orgName>
								<address>
									<addrLine>Clarendon House, 39 Clarendon Street</addrLine>
									<settlement>Dublin</settlement>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Pricing risk: Analysis of Irish Car Insurance Premiums</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">0619509F05D88B9BB4C2AB18BC8CA881</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:31+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Car insurance premiums</term>
					<term>bias detection</term>
					<term>explainable AI</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>With the increasing prevalence of artificial intelligence assisting with decision-making and forthcoming EU legislation attempting to ensure it does no harm to its citizens, this paper evaluates what factors influence the cost of car insurance for individual drivers. The principle is clear: the cost of an insurance premium is determined by the perceived risk of the policyholder. But what specific elements are considered in assessing a driver's risk? To demystify this model, this study developed an automated process to gather quote data based on various factors, such as the driver's gender, age (as a proxy for driving experience), geographical location, occupation, and driving history. We conducted an audit of pricing algorithms employed by insurance companies in the Irish car insurance industry, by gathering quotes through online car insurance websites available in Ireland. This research provides insights into some of the factors influencing car insurance premiums in Ireland, highlighting some of the intricacies behind the complex, algorithmic calculations of car insurance quotations. While acknowledging the complexity of the industry, we find evidence of several potentially problematic issues. We show that place of residence and occupation have a direct and sizeable impact on the prices quoted to drivers.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Companies are developing increasingly complex machine learning-based models for car insurance pricing <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. As such, forthcoming EU Artificial Intelligence (AI) Act legislation 2 classifies insurance as 'high-risk' via the access to essential services domain thereby compelling all insurers within the EU who use 'AI' as part of their operation to comply with the forthcoming legislation. Consequently, compliance means undertaking bias detection to mitigate any harmful effects arising from the use of AI (this aspect is covered within section 5 of Article 10 -Data and Data Governance). The motivation for this paper stemmed from trying to understand the potential for bias in AI systems that need to comply with the forthcoming EU AI Act legislation. Moreover, insurers do not normally provide customers with much information on why they have been offered a certain price, and they do not publish data showing the prices offered to different groups. This study aimed to shine a light on these inconspicuous aspects of car insurance premiums.</p><p>At the intersection of these considerations, this study delves into the pricing practices of ten insurance companies in Ireland. By scrutinising the factors and their respective magnitudes that contribute to car insurance quotes, this research aimed to unravel the complexities of the business model. Moreover, this study sought to contribute to a comprehensive understanding of the dynamics of car insurance premiums and foster greater transparency within the industry. We hope that this endeavour will encourage insurers to consider undertaking and publishing independent algorithmic bias audits on their website. Similar to the requirement stipulated by the New York City Local Law 144 of 2021 on employment decisions assisted by AI 3 , this practice will go towards enhancing transparency and trust.</p><p>To the best of our knowledge, no study to date has analysed the contextual influence of certain features on car insurance pricing in Ireland. Additionally, we are unaware of any study that has examined the influence of socio-demographic features on pricing.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Research questions.</head><p>In this paper, we exploited nearly 40,000 quotes gathered via an automated retrieval process.</p><p>RQ1: What are the factors that play a major role in setting Irish car insurance premiums? RQ2: Do gender and ethnicity inferred by name directly influence quoted premiums? RQ3: Are riskier driver profiles uniformly discriminated against regardless of location and occupation?</p><p>Using an experimental design, we gathered data by varying some features of the 'applicant' while keeping the rest of the features constant. This allowed us to conduct an algorithmic audit to uncover any connections between inputs and outputs. This study builds upon the work of <ref type="bibr" target="#b2">Fabris et al. (2021)</ref>  <ref type="bibr" target="#b2">[3]</ref> and Cook et al. (2022) <ref type="bibr" target="#b3">[4]</ref> whose efforts inspired this research.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Background and related work</head><p>Since 2012, the use of gender in the Irish car insurance industry has been regulated. The European Union has adopted legislation which prohibits the direct use of gender for setting insurance premiums <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b5">6]</ref>. The principle of gender equality is enshrined by Articles 21 and 23 of the Charter of Fundamental Rights of the European Union <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8]</ref>. Gender equality has been explicitly operationalised in the context of insurance, with Article 5(1) of Council Directive 2004/113/EC <ref type="bibr" target="#b8">[9]</ref>, stating that no difference in individuals' premiums can result from the use of gender as an explicit factor, and fully confirmed in a 2011 judgement by the European Court of Justice <ref type="bibr" target="#b5">[6]</ref>. Official guidelines on the application of the ruling <ref type="bibr" target="#b4">[5]</ref> explicitly mention motor insurance, clarifying that indirect discrimination remains possible where justifiable: "For example, price differentiation based on the size of a car engine in the field of motor insurance should remain possible, even if statistically men drive cars with more powerful engines". Moreover, information about gender may still be collected, stored, and used, e.g. to monitor portfolio mix or for the purposes of reinsurance.</p><p>In terms of geographic location, which is a publicly acknowledged but often underestimated factor, it emerges as a substantial determinant of car insurance premiums. An Irish insurance broker, Chill, has released research on the price differences in car insurance based on the driver's location within Ireland <ref type="bibr">[10]</ref>. The study by the car insurance broker found that Longford had the highest quotes among all locations (average €738). While the rationale behind this practice often hinges on the increased risks associated with higher-crime areas (for example), questions arise about the equity of imposing a 'penalty' on drivers based solely on their residential neighbourhood. Despite this focus on creating an individual price for each customer, there is evidence that some groups are likely to pay more for insurance than others. Previous research has indicated that people of colour could be experiencing worse outcomes -including higher prices -in insurance markets compared to white consumers. This paper follows a range of investigations looking at unequal outcomes in the insurance market. In November 2015, the Consumer Federation of America found that price of car insurance offered to drivers increased where the proportion of African Americans living in a community increased <ref type="bibr">[11]</ref>. In July 2016, research by Webber Phillips found a postcode-based ethnicity penalty for motor insurance customers in the UK, affecting 12 million people <ref type="bibr">[12]</ref>.</p><p>To the best of our knowledge, no such investigation has been conducted for the Irish car insurance market. Our aim was therefore to close a transparency gap between insurers and customers in Ireland and help develop best practice when it comes to bias detection which forms part of the forthcoming EU AI Act.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Data and methods 4</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Design of "mystery shopper" experiment</head><p>Firstly, the Competition and Markets Authority (CMA) has suggested mystery shopping as an appropriate technique for investigating potential harm caused by algorithmic bias <ref type="bibr">[13]</ref>. Secondly, to conduct our research, we crafted individual scripts tailored to each insurer used in this study. The scripts were written using the set of questions provided on each insurer's website. A total of ten insurance companies were included in this study. This approach facilitated the automated retrieval of insurance quotes by referencing a spreadsheet containing the profiles of individual drivers and their corresponding responses. Driver names were sourced from online databases categorising names by gender and ethnicity. For standardisation, we set driver ages at 25, 40, and 60, linking age to years of driving experience and policy in their own name. We established the relationship between the driver's age and driving experience: age -18 = years of driving experience, and age -19 = years of policy in their own name. Consequently, age was a perfect proxy for driving experience and years of policy in own name in our study. We also strategically selected 20 locations (i.e. 'locales') to capture diverse contrasts, encompassing urban versus rural settings, variations in house prices, levels of deprivation, ethnic diversity, and per capita statistics for crime, road traffic accidents, and penalty points. These 20 locations were drawn from the following six counties: Dublin, Wicklow, Cork, Longford, Roscommon, and Donegal.</p><p>We employed multivariate log-linear regression models to derive mean predicted quotes, transforming the quotes on a log base 10 scale to address positive skewness. The predictor variable sets varied depending on the like-with-like comparison under consideration, but the model predicted mean quotes all accounted for age, gender, locale, insurer, and time of quote (hour/day/month). To illustrate feature importance within each model setup, we used SHAP (SHapley Additive exPlanations) plots after running random forest regression models <ref type="bibr">[14]</ref>. We also complemented this approach with AVTS (Absolute Value Test Statistic) numbers derived from the log-linear regression models <ref type="bibr" target="#b9">[15]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Results 5</head><p>We contextualised the quotes retrieved according to characteristics attributable to either the locale or county level and we present these results in Figure <ref type="figure" target="#fig_0">1</ref> below. These characteristics include ethnic makeup, house prices, deprivation score, crime, road traffic accidents, and penalty points. In Figure <ref type="figure" target="#fig_0">1</ref>, the top four plots displaying proportion of white Irish, mean house prices, level of deprivation (10 = highest), and criminal offences per capita appear to have a bigger impact on quote amount than the bottom four plots displaying fatal/serious/minor road traffic accidents and penalty points per capita.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Discussion and conclusion</head><p>Our research revealed several noteworthy findings. We found no disparity between male and female drivers' insurance quotes, which shows that the EU legislation banning gender discrimination in car insurance premiums is working well in Ireland. Next to NCB/days since claim and age, which was a proxy for driving experience, locale was the third most important variable affecting car insurance premiums. The locations with higher crime rates, more deprivation, more ethnic diversity, and more fatal road accidents directly correlated with higher car insurance premiums. Additionally, our investigation revealed that drivers from higher-premium locations were disproportionately penalised for having zero NCB, making claims, or receiving penalty points compared to their counterparts in lower-premium areas.</p><p>Combining the contextual data with our study results gives us a grasp of how insurance pricing works. It is essential to consider the contextual factors associated with specific locations and the socioeconomic dynamics of communities when seeking to comprehend the determinants influencing car insurance costs. This approach significantly contributes to a comprehensive understanding of the overarching implications for individuals with insurance in varied geographic settings. Furthermore, occupation, specifically retail, significantly influenced car insurance premiums. This impact also varied across locations, highlighting geographical disparities.</p><p>The interplay of location, demographics, and occupation adds depth to the industry's complexities. These insights extend beyond immediate premium considerations, offering a foundation for future research and policy considerations. As innovative technologies, including AI, continue to advance, it is inevitable that car insurance companies will integrate these tools into their processes. Therefore, a comprehensive understanding of the sophisticated dynamics involved in determining car insurance premiums is paramount for stakeholders and policymakers. It is imperative to conduct a thorough assessment of the decision-making mechanisms employed by car insurance companies' models to mitigate the risk of potential biases and perceived unfairness.</p><p>Limitations and future work. Despite retrieving nearly 40,000 quotes, our final dataset cannot be considered fully representative of the Irish driving population at large. Moreover, we were only able to examine a subset of the relevant features, which does not fully characterise the performance of the pricing algorithm. <ref type="bibr" target="#b2">Fabris et al. (2021)</ref> ran into a similar limitation when studying the Italian car insurance market <ref type="bibr" target="#b2">[3]</ref>. Also, like Cook et al. (2022) <ref type="foot" target="#foot_2">6</ref> , our exploratory research allowed us to test the outcomes of pricing mechanisms, but we cannot explain why the outcomes we identified occurred <ref type="bibr" target="#b3">[4]</ref>. While our experiments showed possible discrimination regarding differential quotes, we did not attempt to quantify the impact of this discrimination for all of Ireland. We leave this large and complex task as an interesting endeavour for future work.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Effect of county and locale contextual data on quote amount</figDesc><graphic coords="3,83.83,496.47,455.33,136.00" type="vector_box" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_0">Section reduced due to space constraints. Full list of assumptions will be presented at EWAF'24.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_1">It is not possible to present all our results in this extended abstract due to space constraints. Full results will be presented at EWAF'24.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_2">Our study established similar results in terms of no significant difference in premiums charged to people with different names in the same location, but higher average quotes in areas where non-white ethnicities make up a large proportion of the population.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Acknowledgements</head><p>This research, jointly produced by CeADAR, Ireland's Centre for Artificial Intelligence, and Idiro Analytics, has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 847402. The content of this paper reflects only the authors' views. Neither Enterprise Ireland nor the European Commission are responsible for any use that may be made of the information disclosed.</p></div>
			</div>


			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"> <ref type="bibr" target="#b1">2</ref> <p>See https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:52021PC0206 3 See https://rules.cityofnewyork.us/wp-content/uploads/2023/04/DCWP-NOA-for-Use-of-Automated-Employment-Decisionmaking-Tools-2.pdf</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Estimating Car Insurance Premia: A Case Study in High-Dimensional Data Inference</title>
		<author>
			<persName><forename type="first">Nicolas</forename><surname>Chapados</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yoshua</forename><surname>Bengio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pascal</forename><surname>Vincent</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Joumana</forename><surname>Ghosn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Charles</forename><surname>Dugas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ichiro</forename><surname>Takeuchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Linyan</forename><surname>Meng</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic</title>
				<meeting>the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic<address><addrLine>Vancouver, British Columbia, Canada; Cambridge, MA, USA</addrLine></address></meeting>
		<imprint>
			<publisher>MIT Press</publisher>
			<date type="published" when="2001">2001</date>
			<biblScope unit="page" from="1369" to="1376" />
		</imprint>
	</monogr>
	<note>NIPS&apos;01</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Research on the UBI Car Insurance Rate Determination Model Based on the CNN-HVSVM Algorithm</title>
		<author>
			<persName><forename type="first">Chun</forename><surname>Yan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xindong</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Xinhong</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="160762" to="160773" />
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Algorithmic Audit of Italian Car Insurance: Evidence of Unfairness in Access and Pricing</title>
		<author>
			<persName><forename type="first">Alessandro</forename><surname>Fabris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alan</forename><surname>Mishler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stefano</forename><surname>Gottardi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mattia</forename><surname>Carletti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Matteo</forename><surname>Daicampi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gian</forename><forename type="middle">Antonio</forename><surname>Susto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gianmaria</forename><surname>Silvello</surname></persName>
		</author>
		<idno type="DOI">10.1145/3461702.3462569</idno>
		<ptr target="https://doi.org/10.1145/3461702.3462569" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES &apos;21)</title>
				<meeting>the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES &apos;21)<address><addrLine>Virtual Event, USA; New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2013">2021. May 19-21, 2021. 13</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Discriminatory pricing: Exploring the &apos;ethnicity penalty&apos; in the insurance market</title>
		<author>
			<persName><forename type="first">Tilly</forename><surname>Cook</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Aiden</forename><surname>Greenall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Emer</forename><surname>Sheehy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Citizens Advice</title>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Guidelines on the application of Council Directive 2004/113</title>
		<ptr target="https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:52012XC0113" />
	</analytic>
	<monogr>
		<title level="m">European Union in Case C-236/09 (Test-Achats) C-11</title>
				<imprint>
			<publisher>EC -European Commission</publisher>
			<date type="published" when="2012-01">2012. 01</date>
		</imprint>
	</monogr>
	<note>EC to insurance, in the light of the judgment of the Court of Justice of the</note>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<ptr target="https://eur-lex.europa.eu/legal-content/EN/ALL/?uri=CELEX:62009CJ0236" />
		<title level="m">Association belge des Consommateurs Test-Achats ASBL v Conseil des ministres</title>
				<imprint>
			<date type="published" when="2011">2011. 2011</date>
		</imprint>
	</monogr>
	<note>C-236/09</note>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<ptr target="https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX%3A32000X1218%2801%29" />
		<title level="m">Charter of Fundamental Rights of the European Union C-364/01</title>
				<imprint>
			<publisher>EU -European Union</publisher>
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<ptr target="https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:12012P/TXT" />
		<title level="m">Charter of Fundamental Rights of the European Union C-326/391</title>
				<imprint>
			<publisher>EU -European Union</publisher>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<ptr target="https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex:32004L0113" />
		<title level="m">Implementing the principle of equal treatment between men and women in the access to and supply of goods and services L-373-37</title>
				<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
	<note>Council of the EU</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">How to obtain the P value from a confidence interval</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">G</forename><surname>Altman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Bland</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bmj</title>
		<imprint>
			<biblScope unit="volume">343</biblScope>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
