翻译下面一段文字到英文，谢谢随着信息时代的到来，大量数据的处理成为了人们研究的重点，寻找其规律并对其进行运用。对大量数据

翻译下面一段文字到英文，谢谢
随着信息时代的到来，大量数据的处理成为了人们研究的重点，寻找其规律并对其进行运用。对大量数据进行分类是人们常做的事，分类问题是数据挖掘领域中重要的研究内容之一。传统的分类方法对平衡数据集分类取得了良好的效果，但实际的数据集往往不平衡：即数据集中某类的样本数远远大于其他类的样本数目。对于基于总体分类精度为学习目标的传统分类器而言，这种不均衡势必会导致分类器过多关注多数类样本，从而使少数类样本分类性能下降。而在实际应用中，人们更关心的恰恰是数据集中的少数类，并且错分这些少数类的代价也通常大于多数类。
对不平衡数据可以从算法层面和数据层面来处理。本文的出发点是从算法层面来对数据进行分类，研究如何使用支持向量机方法对不平衡数据进行分类，主要工作有一下几点：
一：对支持向量机理论进行研究。
二：对不平衡数据分类进行研究。
三：给出一中基于聚类的数据集划分支持向量机方法DISVM。

yuanzi5811 1年前已收到1个回答举报

yinbing1030 幼苗

共回答了23个问题采纳率：87% 举报

With the advent of the information age, a lot of data processing has become a focus of research, to find its law and its application. Large amounts of data classification is that people often do, classification data mining is one of the important areas of research. The traditional classification of equilibrium data set classification achieved good results, but the actual data sets are often unbalanced: the sample dataset certain number is far greater than other classes of the number of samples. For the overall classification accuracy based on learning objectives of the traditional classification terms, this imbalance will inevitably lead to too much attention to the majority class classifier sample, so that the minority sample classification performance. In practice, people are more concerned with just a few classes in the dataset, and misclassification costs of those few classes also usually larger than most classes.
Imbalance data can be processed from algorithms and data planes. The starting point of this article is from the algorithm level to classify the data, how to use support vector machine for unbalanced data classification, the main work has the following points:
A: support vector machine theory research.
II: unbalanced data classification research.
Three: given a set of data based on clustering support vector machine division DISVM.

1年前

可能相似的问题