Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting non-overlapping sets under the same cluster #6

Open
ale0xb opened this issue Jun 17, 2020 · 0 comments
Open

Getting non-overlapping sets under the same cluster #6

ale0xb opened this issue Jun 17, 2020 · 0 comments

Comments

@ale0xb
Copy link

ale0xb commented Jun 17, 2020

Hello,

I've been playing around with HierarchicalSets and some toy data (see attachment).

The data is loaded from a JSON file and fed into format_sets() like this:

data <- fromJSON(file = "data.json")

universe <- unique(unlist(data))

x <- lapply(data, function(set) {
  universe %in% set
})

setNames = names(x)

colnames(sets) <- setNames
rownames(sets) <- universe

formatted_set <- format_sets(sets)
dataSet <- create_hierarchy(formatted_set)

Then I check the cluster results with:
cluster_members(keySet)

Looking at the output, I see that the algorithm clusters together "el-1272" and "el-2132". However, they do not have any elements in common.

Is this expected behavior? If I understood correctly, this should not happen as the homogeneity between these two sets will always be 0.

Can anyone help me to find out what's going on? Thanks

data.json.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant