SPLICE

This article introduces the SPLICE R package for data simulation, a successor to SynthETIC.
R
data
simulation
Authors

Greg Taylor

Sarah MacDonnell

Published

November 14, 2022

The R data simulation package SynthETIC has been overtaken by an updated version, SPLICE (Synthetic Paid Loss and Incurred Cost Experience).

SPLICE, whilst still based on SynthETIC, has now been extended to simulate case estimates, and hence incurred claims. It can be accessed on CRAN, along with other relevant resources including a reference manual.

SPLICE is a useful tool for producing simulated datasets for testing out various reserving, including machine learning, methods. It generates datasets of triangles, as well as individual claims transactions, showing paid and incurred developments by occurrence as well as notification and settlement times.

The user can set up specific features in the datasets. For example in the Al-Mudafer thesis which we have previously featured, 4 different types of claims triangles were generated:

The MLR WP used these same datasets in our GIRO 21 workshop Machine Learning Reserving on Triangle Data.

We recommend using SPLICE over SynthETIC going forward as we understand that SPLICE is the one that will continue to be supported and updated.

Our previous article introducing SynthETIC can be found here.

About the authors


Copyright © Machine Learning in Reserving Working Party 2024