Optimized sampling with clustering approach for large intrusion detection data
Data mining is a process of discovering useful information from a data set. In data mining, there is a classification technique that depends on sampling accuracy to acquire a more accurate result in data classification or prediction. Therefore, a necessity in getting a good-quality sampling is required. The primary purpose of this research paper is to obtain the optimum sampling representing the original data set. Through sampling, we could minimize the total data that need to be processed. Because large amount of data requires longer processing time, reducing the amount of data with sampling will speed up the process of computing. In this study we introduced a new sampling algorithm with clustering approach applied to a network security data set. The final results showed that proposed method offer fine result for large data set sampling.
B00533 | (wh) | Available |
No other version available