A Parallel Solver for Generalised Additive Models

Date

1999

Authors

Hegland, Markus
McIntosh, I
Turlach, B

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Abstract

An implementation of the backfitting algorithm for generalised additive models which is suitable for parallel computing is described. This implementation is designed to handle large data sets such as those occurring in data mining with several millions of observations on several hundreds of variables. For such large data sets it is crucial to have a fast, parallel implementation for fitting generalised additive models to allow an exploratory analysis of the data within a reasonable time. The approach used divides the data into several blocks (groups) and fits a (generalised) additive model to each block. These models are then merged to a single, final model. It is shown that this approach is very efficient as it allows the algorithm to adapt to the structure of the parallel computer (number of processors and amount of internal memory).

Description

Keywords

Keywords: Computer systems programming; Data mining; Data reduction; Data structures; Parallel algorithms; Backfitting algorithms; Generalized additive models; Parallel processing systems Additive models; Backfitting; Data mining; Local scoring; Parallel algorithms

Citation

Source

Computational Statistics and Data Analysis

Type

Journal article

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

2037-12-31