Questions tagged [k-means]

k-means is a clustering algorithm, implemented in popular data science tools. Use this tag for questions related to the k-means clustering algorithm itself, or to its use with the tools that implement it (alongside other tags specific to those tools).

In statistics and data mining, k-means clustering is a method of cluster analysis which aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean by least-squared deviations.

For detailed info check Wikipedia entry at http://en.wikipedia.org/wiki/K-means_clustering

3514 questions

votes

2 answers

How to detect multiple objects with OpenCV in C++?

I got inspiration from this answer here, which is a Python implementation, but I need C++, that answer works very well, I got the thought is that: detectAndCompute to get keypoints, use kmeans to segment them to clusters, then for each cluster do…

asked Sep 20 '18 at 12:37

Suge

2,808
3
48
79

votes

2 answers

How to use silhouette score in k-means clustering from sklearn library?

I'd like to use silhouette score in my script, to automatically compute number of clusters in k-means clustering from sklearn. import numpy as np import pandas as pd import csv from sklearn.cluster import KMeans from sklearn.metrics import…

python-2.7 machine-learning scikit-learn k-means silhouette

asked Jul 02 '18 at 14:40

Jessica Martini

votes

1 answer

Using a smoother with the L Method to determine the number of K-Means clusters

Has anyone tried to apply a smoother to the evaluation metric before applying the L-method to determine the number of k-means clusters in a dataset? If so, did it improve the results? Or allow a lower number of k-means trials and hence much greater…

algorithm cluster-analysis k-means linear-regression

asked Oct 27 '10 at 13:35

winwaed

7,645
6
36
81

votes

1 answer

Using dplyr and broom to compute kmeans on a training and test set

I am using dplyr and broom to compute kmeans for my data. My data contains a test and training set of X and Y coordinates and are grouped by a some parameter value (lambda in this case): mds.test = data.frame() for(l in seq(0.1, 0.9, by=0.2)) { …

r dplyr k-means broom

asked Oct 18 '16 at 04:40

user2117258

votes

1 answer

initial centroids for scikit-learn kmeans clustering

if I already have a numpy array that can serve as the initial centroids, how can I properly initialize the kmeans algorithm? I am using the scikit-learn Kmeans class this post (k-means with selected initial centers) indicates that I only need to set…

python scikit-learn cluster-analysis k-means

asked Jul 13 '16 at 14:54

webmaker

votes

2 answers

How to identify Cluster labels in kmeans scikit learn

I am learning python scikit. The example given here displays the top occurring words in each Cluster and not Cluster name. http://scikit-learn.org/stable/auto_examples/document_clustering.html I found that the km object has "km.label" which lists…

python machine-learning scikit-learn cluster-analysis k-means

asked Feb 05 '15 at 13:00

vij555

votes

1 answer

What is the difference between SOM (Self Organizing Maps) and K-Means?

There is only one question related to this in stackoverflow, and it is more about which one is better. I just dont really understand the difference. I mean they both work with vectors, which are assigned randomly to clusters, they both work with the…

artificial-intelligence k-means som self-organizing-maps

asked Apr 02 '13 at 00:31

Oliver Hoffman

votes

8 answers

k-means empty cluster

I try to implement k-means as a homework assignment. My exercise sheet gives me following remark regarding empty centers: During the iterations, if any of the cluster centers has no data points associated with it, replace it with a random data…

k-means

asked Jun 17 '12 at 22:23

toobee

2,592
4
26
35

votes

1 answer

How to specify distance metric while for kmeans in R?

I'm doing kmeans clustering in R with two requirements: I need to specify my own distance function, now it's Pearson Coefficient. I want to do the clustering that uses average of group members as centroids, rather some actual member. The reason for…

r cluster-analysis k-means

asked Sep 23 '11 at 03:51

Derrick Zhang

21,201
18
53
73

votes

4 answers

Reading wav file in Java

I want to read wav files in Java and I am going to classify them with K-means. How can I read wav files in Java and assign them into an array or something like that(you can suggest ideas for it) to classify them? EDIT: I want to use APIs for reading…

java audio wav k-means

asked Mar 06 '11 at 11:18

kamaci

72,915
69
228
366

votes

1 answer

k-means with selected initial centers

I am trying to k-means clustering with selected initial centroids. It says here that to specify your initial centers: init : {‘k-means++’, ‘random’ or an ndarray} If an ndarray is passed, it should be of shape (n_clusters, n_features) and gives…

python numpy scikit-learn k-means

asked Mar 04 '15 at 18:40

lel

votes

1 answer

How to Find Documents That are in the same Cluster with KMeans

I have clustered various articles together with the Scikit-learn framework. Below are the top 15 words in each cluster: Cluster 0: whales islands seaworld hurricane whale odile storm tropical kph mph pacific mexico orca coast cabos Cluster 1: ebola…

python artificial-intelligence scikit-learn k-means

asked Sep 14 '14 at 01:45

Stunner

12,025
12
86
145

votes

2 answers

k-means return value in R

I am using the kmeans() function in R and I was curious what is the difference between the totss and tot.withinss attributes of the returned object. From the documentation they seem to be returning the same thing, but applied on my dataset the value…

r k-means least-squares

asked Dec 26 '11 at 16:29

Marius

votes

3 answers

Where to find a reliable K-medoid(Not k-means) open source software/tool?

I am learning the K-medoids algorithm so I am sorry if I ask inappropriate questions. As I know,the K-medoids algorithm implements a K-means clustering but use actual data points to be centroid instead of mathematical calculated means. As I googled…

open-source cluster-analysis k-means

asked Oct 05 '11 at 20:03

Cassie

1,179
6
18
30

votes

2 answers

Weka simple K-means clustering assignments

I have what feels like a simple problem, but I can't seem to find an answer. I'm pretty new to Weka, but I feel like I've done a bit of research on this (at least read through the first couple of pages of Google results) and come up dry. I am using…

cluster-analysis data-mining weka k-means

asked Jul 13 '11 at 21:32

machine yearning

9,889
5
38
51

Prev 1 2 3

…

99 100 Next