适用于国际功能核心组别分类的普通变量最佳刑事比值 (Penalized Optimal Scaling for Ordinal Variables with an Application to International Classification of Functioning Core Sets)

Ordinal data occur frequently in the social sciences. When applying principal component analysis (PCA), however, those data are often treated as numeric implying linear relationships between the variables at hand, or non-linear PCA is applied where the obtained quantifications are sometimes hard to interpret. Non-linear PCA for categorical data, also called optimal scoring/scaling, constructs new variables by assigning numerical values to categories such that the proportion of variance in those new variables that is explained by a predefined number of principal components is maximized. We propose a penalized version of non-linear PCA for ordinal variables that is a smoothed intermediate between standard PCA on category labels and non-linear PCA as used so far. The new approach is by no means limited to monotonic effects and offers both better interpretability of the non-linear transformation of the category labels as well as better performance on validation data than unpenalized non-linear PCA and/or standard linear PCA. In particular, an application of penalized optimal scaling to ordinal data as given with the International Classification of Functioning, Disability and Health (ICF) is provided.

翻译：在应用主要组成部分分析(PCA)时,这些数据往往被视为数字式的表示手头变量之间的线性关系,或者在获得的量化有时难以解释的情况下使用非线性五氯苯甲醚。非线性五氯苯甲醚用于绝对数据,也称为最佳评分/缩放,通过给不同类别分配数字值来构建新的变量,使以预定主要组成部分数量解释的新变量的差异比例最大化。我们建议了非线性五氯苯甲醚的处罚版本,用于分类标签上的标准五氯苯甲醚和非线性五氯苯甲醚之间平滑的中间变量。新的方法绝不局限于单体效应,而是提供较佳的分类非线性变换的可解释性,以及比非线性非线性五氯苯甲醚和/或标准线性五氯苯甲醚在验证数据上的更好性表现。我们特别建议了对国际功能、残疾和健康分类(ICF)中给出的正性数据进行最优度缩放。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日