-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing Dataset #20
Comments
Hi, You can find the datasets here https://github.com/Gaglia88/sparker/tree/master/python/datasets If you need a specific file let me know. Regards, |
Hi, To run the EntityClusteringTests and Progressive. Can you please help me about it. Regards, |
Hi, Regarding "outf.txt" it is the output of an entity matching function applied to the pairs of profiles retained after applying the meta-blocking. An example could be: If you look in this file https://github.com/scify/JedAIToolkit/blob/master/src/test/java/org/scify/jedai/entityclustering/TestAllMethods.java you can generate the "outf.txt" file by writing the content of the simPairs variable in the following way
I hope this will help. Regards, |
Hi, Regards, |
Hi, Regarding the "matches.txt" file, I do not remember how it was created. |
Dear developers,
In the Experiments package, there are some real applications of algorithms.
But I couldn't find the source the experiment programs use in your github because the pathes of the sources are almost your own computer path like this
C:/Users/gagli/Desktop/outf.txt,
C:\Users\gagli\Desktop\gt.csv,
C:\Users\gagli\Downloads\syntheticDatasets\syntheticDatasets\10Kprofiles.json,
C:\Users\gagli\Downloads\syntheticDatasets\syntheticDatasets\10KIdDuplicates.json,
C:\Users\gagli\Desktop\gt.csv
Without these file, I have to speculate the structure and thus could not understand the program correctly.
May I ask you to upload these files to the Github. And that will help me a lot. Thank you very much!
The text was updated successfully, but these errors were encountered: