Inductis
Who We Are

Who we are

Inductis White Paper
Emerging Trends in Auto Insurance Pricing
Frank Cacchione, Arnab Dey & Ritesh Aggarwal
 

ABSTRACT
Competition in the insurance market today is intense and orthodox pricing mechanisms are already failing. Any risk model overestimates the level of risk of some groups of policies and underestimates the risks of others. Overcharging low-risk policyholders drives them away to seek lower rates from competitors, thereby reducing revenue and market share. Undercharging high-risk policyholders attracts similar high-risk customers, thereby driving up claims and losses. To remain competitive, insurers must constantly improve the assessments of their true levels of risk and price policyholders accordingly.

In the U.S. auto insurance industry, many insurers have been slow to improve their modeling and pricing approaches with the new analytic tools they have at their disposal. Traditionally, the industry relied on loss ratio models, used limited external data, had a compartmentalized model structure and used variables that were difficult to obtain accurately.

Led by a few insurers, the industry is headed towards pure premium modeling at individual coverage level using extensive external data and variables that can be verified. These companies are also improving their data collection process so that they have access to richer information to make better pricing decisions. They are using advanced software tools such as SAS, CART and MARS, and new modeling techniques like Poisson and Gamma regression for accurate analysis. They are also focusing on more accurately allocating non-loss related expenses such as acquisition expenses into the pricing structure as opposed to traditional fixed load methods.

While the industry continues to face challenges such as regulatory restrictions on use of certain variables, difference across states and data integration across the organization, companies that embrace these new techniques while managing the market challenges will emerge winners in the next few years at the expense of those who continue to rely on outdated methods that can lead to further erosion of market share and adverse risk selection.

With increased data collection ability, improved database management techniques, and advanced statistical tools, companies today can mine large quantities of data and perform complex statistical analysis to drive optimal pricing decisions.
There are five key levers companies need to master to effectively leverage the power of data and analysis in order to drive profitable growth through improved pricing:

  1. Data Preparation
  2. Modeling Approach
  3. Model Structure
  4. Use of External Data
  5. Allocation of Other Expenses

(A) DATA PREPARATION
For insurance pricing, large datasets need to be analyzed.
A large auto insurer could have gigabytes of data on millions of policies spanning over 8-10 years. Therefore, optimal data preparation and processing techniques should be followed to avoid long processing times.
There are five steps to effective data preparation:

  1. Data Cleansing: This step involves understanding the variables and their business meaning which in turn is used to treat variable outlier values and missing values of variables.
  2. Data Roll-up: Most companies have transactional level data, with multiple records for a policy if it had endorsements. Some companies may also have multiple records for a policy due to other reasons, such as calendar year reporting. In such cases, rolling up the data to enable policy-level analysis becomes important. At the same time, presence of fewer records for each policy increases the predictiveness of models. Data roll-up involves determining a methodology which appropriately treats policy changes (e.g. addition of a driver/vehicle, cancellation, change in coverage, etc.) and combines multiple transactional level records into one record per policyvehicle- term.
  3. Data Profiling: Often a-priori knowledge of the nature of relationship between the variables is not available or easily derivable. In such cases, detailed exploratory analysis is required to identify the relationships among events and values to generate business hypotheses before diving into modeling.
  4. Variable Selection: Transaction datasets have a large number of variables, with quite a few having very little relevance to predictive modeling. A useful set of variables is selected to ensure meaningful modeling and manageable dataset size. Variables are selected through techniques such as clustering, correlations, etc.
  5. Data Exploration: Various techniques such as univariate analysis, multivariate analysis and advanced visualization are used to understand the relationships between variables in detail. These also help in introducing derived variables that may enhance the predictiveness of the model. A layout of these techniques is shown in Figure 1, and an illustrative CART tree is shown in Figure 2.

(B) MODELING APPROACH
The traditional method for constructing risk models involved segmenting the overall population of policy holders into a collection of risk groups based on a set of factors, such as age, gender, driving distance to place of employment, etc. This assumed that the resulting segments are homogeneous in terms of risk. The introduction of credit scores was a significant first step in moving beyond class plan variables in developing different pricing tiers but mostly as overlays of existing class plans.

Over time with the development of new tools and data mining techniques, the focus shifted to loss ratio modeling. This method appears simple but is often misleading. A ratio greater than 1.00 indicates that more losses have been paid than premiums that would have been earned at current rate level, suggesting that an increase in the premium rate is needed. The converse also applies.

This method is most appropriate when the units exposed
to loss are homogenous and do not change substantially over time. But given the nature of the industry and competitive offers available to customers, the assumption of having uniform unit exposure is often violated.

The most advanced companies are now moving beyond this method towards pure premium (claim frequency X claim severity) modeling which by definition is free from this fundamental assumption fallacy1.

At the same time, complex statistical techniques like Poisson and Gamma regression are now being used. In general, number of claims in a given time period should satisfy the following:

  • The number of claims occurring in any two disjoint time intervals is independent
  • No more than one claim may arise from the same event.
  • The probability that a claim occurs at a given
    fixed time point is equal to zero
Under the above assumptions it can be easily proved that number of claims in a given time period is Poisson distributed2.

     Page 1 2
Quick Links for Financial and Insurance Consulting Services and More...
Apply For Insurance Consulting Services-Inductis

APPLY TO INDUCTIS

Inductis - Focusing On Professional Financial Consulting & Insurance Services
FOCUS AREAS
Case Study of Best Financial Consulting Services & Insurance Consulting-Inductis and More...
CASE STUDIES
  Select examples of how Inductis teams have achieved results for a variety of clients ...more >>
Best Financial Consulting Company- Inductis
PUBLICATIONS
  Our thoughts on how organizations can elevate their performance ...more >>
Site Map -Inductis
SITE MAP
Contact Us for Financial Services and Insurance Consulting Services - Inductis
CONTACT US
Copyright © 2002 - 2008 Inductis Inc.