Hardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms

dc.contributor.advisorJohn, Eugene
dc.contributor.authorSingh, Manisha
dc.contributor.committeeMemberBanerjee, Taposh
dc.contributor.committeeMemberKrishnan, Ram
dc.descriptionThis item is available only to currently enrolled UTSA students, faculty or staff. To download, navigate to Log In in the top right-hand corner of this screen, then select Log in with my UTSA ID.
dc.description.abstractThe tremendous increase in complex unbounded dynamic data in the fields of geology, health, image processing, climatology, network security, etc. requires high speed and performance in data processing for analyzing big data. Current big data analysis algorithms are software implementation based and are not sufficient for processing and analyzing the data efficiently. Big data analysis algorithm involves data mining which partitions and clusters similar data into meaningful groups. The objective of this study is to provide big data analysis hardware implementation of the K-means and K-means++ clustering algorithms which are used to cluster similar data into groups and compare the cluster forming approach of both algorithms. Both algorithms use a state machine to process 3-dimensional data vectors and place them into a meaningful cluster after a fixed number of iterations and indicate final processed cluster data vectors by generating an interrupt. These algorithms are implemented in the Verilog hardware description language. Synthesis of both the ASIC design is done with Xilinx ISE Design Suite 13.4 using 45nm and 90nm technology nodes to calculate the gate count, power, delay, and area. Finally, the result of this research shows that the hardware implementation of K-means ++ puts the data vectors more meaningfully than K-means but at the cost of extra hardware.
dc.description.departmentElectrical and Computer Engineering
dc.format.extent73 pages
dc.subject.classificationComputer engineering
dc.subject.classificationComputer science
dc.subject.classificationInformation technology
dc.subject.classificationInformation science
dc.titleHardware Implementation and Performance Evaluation of K-Means and K-Means++ Clustering Algorithms
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.grantorUniversity of Texas at San Antonio
thesis.degree.nameMaster of Science


Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
3.75 MB
Adobe Portable Document Format