Classification modeling on distributed environment

TA Choo, AA Bakar, AB Talebi… - … IEEE Conference on …, 2013 - ieeexplore.ieee.org
TA Choo, AA Bakar, AB Talebi, E Sundararajan, M Rahmany
2013 IEEE Conference on Open Systems (ICOS), 2013ieeexplore.ieee.org
High Performance Computing (HPC) is usually used to solve problems that cannot be
solved on a single machine due to constraints in computing resources such as memory and
number of processor in science and technology. The speed of processing can be improved
through HPC. However, the use of high-powered supercomputer for this type of problems
involves huge cost. In some circumstances, High-Throughput Computing (HTC) on
distributed environments performs parallel processing with speed that are comparable to …
High Performance Computing (HPC) is usually used to solve problems that cannot be solved on a single machine due to constraints in computing resources such as memory and number of processor in science and technology. The speed of processing can be improved through HPC. However, the use of high-powered supercomputer for this type of problems involves huge cost. In some circumstances, High-Throughput Computing (HTC) on distributed environments performs parallel processing with speed that are comparable to supercomputer. In this work, we improve the time and speed in mining process for developing a classification modeling for a large data file on distributed environments via a web-based portal that provides various classification methods. The web-based application was build using PHP language, and adapt combination of data mining software WEKA version 3.6.0 of classification techniques with split percentage of training and testing data. HTCondor middleware is used to control and run all jobs on distributed environment. The results show significant improvement in processing time.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果