Questions tagged [elki]

ELKI is an open source data mining software with the focus on cluster analysis and outlier detection. It uses index structures for accelerating these algorithms.

ELKI is an open source data mining software with the focus on cluster analysis and outlier detection. In contrast to most other tools, it has support for index structures.

164 questions
3
votes
1 answer

ELKI DBSCAN : How to set dbc.parser?

I am doing DBSCAN clustering and I have one more column apart from latitude longitude which I want to see with cluster results. For example data looks like this: 28.6029445 77.3443552 1 28.6029511 77.3443573 2 28.6029436 77.3443458 …
sau
  • 1,316
  • 4
  • 16
  • 37
3
votes
1 answer

ELKI DBSCAN R* tree index

In MiniGUi, I can see db.index. How do I set it to tree.spatial.rstarvariants.rstar.RStartTreeFactory via Java code? I have…
sau
  • 1,316
  • 4
  • 16
  • 37
3
votes
2 answers

Running DBSCAN in ELKI

I am trying to cluster some geospatial data, and I previously tried the WEKA library. I found this benchmarking, and decided to try ELKI. Despite the advice to not use ELKI as a Java library (which is suppose to be less maintained than the UI), I…
doublebyte
  • 1,225
  • 3
  • 13
  • 22
3
votes
2 answers

How can I use the index-structures in ELKI?

These are quotes form http://elki.dbs.ifi.lmu.de/ : "Essentially, we bind the abstract distance query to a database, and then get a nearest neighbor search for this distance. At this point, ELKI will automatically choose the most appropriate kNN…
3
votes
1 answer

Writing ELKI DBSCAN convex hull of clusters to file

I have started using ELKI for data analysis, but one seemingly simple thing I cannot seem to do is output the calculated convex hull of clusters to a file after running DBSCAN. I am able to visualize the convex hulls via the visualization gui, but…
whistler
  • 876
  • 2
  • 15
  • 31
2
votes
0 answers

Visualization results of dbscan using ELKI

I am using ELKI's implementation of DBSCAN to cluster. Could you tell me how can I visualize this data? DatabaseConnection dbc = new ArrayAdapterDatabaseConnection(data); Database db = new StaticArrayDatabase(dbc, null); db.initialize(); …
2
votes
1 answer

DBSCAN: How to Cluster Large Dataset with One Huge Cluster

I am trying to perform DBSCAN on 18 million data points, so far just 2D but hoping to go up to 6D. I have not been able to find a way to run DBSCAN on that many points. The closest I got was 1 million with ELKI and that took an hour. I have used…
gregory
  • 188
  • 2
  • 17
2
votes
1 answer

Parallel DBSCAN in ELKI

Here I can see that there exists class clustering.gdbscan.parallel.ParallelGeneralizedDBSCAN, but when I tried to invoke it, I've got error: java -cp elki.jar de.lmu.ifi.dbs.elki.application.KDDCLIApplication -algorithm…
Slowpoke
  • 1,069
  • 1
  • 13
  • 37
2
votes
1 answer

ELKI DBSCAN Ignore Columns

I have a CSV file with multiple columns where the columns are "lat, lon, item1". I have been able to load the data using the following code: ListParameterization params = new ListParameterization(); List filterlist = new…
SpiltMilk
  • 23
  • 3
2
votes
1 answer

XMeans ELKI fails at every third input file

I'm trying to cluster image data (stored in 100 separate csv files) with ELKI's XMeans algorithm. It works well for the first two files, but then the algorithm hangs on forever while processing the third file. It looks like the problem occurs at…
Charlie28000
  • 67
  • 1
  • 5
2
votes
1 answer

kmeans with sparse vectors in elki

When i try this method with dense vectors data it's run correctly, but with sparse vectors data throws java.lang.ArrayIndexOutOfBoundsException. What datasource can i use to read sparse vectors data correctly? public void runKmeans(double[][] data)…
Wesin Alves
  • 371
  • 1
  • 3
  • 13
2
votes
2 answers

Density Based Clustering with Representatives

I'm looking for a method to perform density based clustering. The resulting clusters should have a representative unlike DBSCAN. Mean-Shift seems to fit those needs but doesn't scale enough for my needs. I have looked into some subspace clustering…
Milan
  • 929
  • 2
  • 13
  • 25
2
votes
2 answers

ELKI OPTICS pre-computed distance matrix

I can't seem to get this algorithm to work on my dataset, so I took a very small subset of my data and tried to get it to work, but that didn't work either. I want to input a precomputed distance matrix into ELKI, and then have it find the…
2
votes
1 answer

How to identify my objects in ELKI DBSCAN results?

I'm using ELKI GUI to run DBSCAN algorithm. My input is a CSV file. I create a projection as feature selection: -dbc.filter transform.ProjectionFilter -projection NumericalFeatureSelection -projectionfilter.selectedattributes 1,2 ELKI gives me…
Omid Ebrahimi
  • 1,150
  • 2
  • 20
  • 38
2
votes
1 answer

ELKI, the DBOutlierDetection algorithm: What measure is d?

Can you tell me which measure the parameter d is in the DBOutlierDetection algorithm (or DBOutlierScore)? Centimetres? Milimetres? I have to somehow compare the area under parameter d with LOF's k.
limoan
  • 105
  • 8
1
2
3
10 11