There is more statistical data produced in today’s modern society than ever before. This data is analysed and cross-referenced for innumerable reasons. However, many data sets have no shared element and are harder to combine and therefore obtain any meaningful inference from. Statistical matching allows just that; it is the art of combining information from different sources (particularly sample surveys) that contain no common unit. In response to modern influxes of data, it is an area of rapidly growing interest and complexity. Statistical Matching: Theory and Practice introduces the basics of statistical matching, before going on to offer a detailed, up-to-date overview of the methods us...
The theme of the meeting was “Statistical Methods for the Analysis of Large Data-Sets”. In recent years there has been increasing interest in this subject; in fact a huge quantity of information is often available but standard statistical techniques are usually not well suited to managing this kind of data. The conference serves as an important meeting point for European researchers working on this topic and a number of European statistical societies participated in the organization of the event. The book includes 45 papers from a selection of the 156 papers accepted for presentation and discussed at the conference on “Advanced Statistical Methods for the Analysis of Large Data-sets.”
Data editing methods and techniques may significantly influence the quality of statistical data as well as the cost efficiency of statistical production. Volume 2 is the logical continuation of the first part of the series, which defined statistical data editing and presented associated methods and software. The aim of these publications is to assist National Statistical Offices in their efforts to improve and economize their data editing processes.
Data analysis and machine learning are research areas at the intersection of computer science, artificial intelligence, mathematics and statistics. They cover general methods and techniques that can be applied to a vast set of applications such as web and text mining, marketing, medical science, bioinformatics and business intelligence. This volume contains the revised versions of selected papers in the field of data analysis, machine learning and applications presented during the 31st Annual Conference of the German Classification Society (Gesellschaft für Klassifikation - GfKl). The conference was held at the Albert-Ludwigs-University in Freiburg, Germany, in March 2007.