Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How do I get the term frequency and the number of co-occurrences of two terms? #48

Open
clark874 opened this issue Jul 19, 2022 · 1 comment
Assignees
Labels

Comments

@clark874
Copy link

Hi John, another question for you:)

In my study, I tried to use one-mode graphs of terms to demonstrate that some terms are more important than others. I processed the graphml file exported by textnets through networkx. I can get the degree of the node and the value of the weight of the edge. According to the documentation, the latter is obtained by tf-idf calculation and is a floating point value.

My question is: Is there a way to get the term frequency of a term in all the documents and the absolute value of the number of co-occurrences of two terms in all the corpus? Both values should be an integer, is it possible to add these value as a COUNT attribute to the node and edge attributes?

Thanks again for your work!

@jboynyc jboynyc self-assigned this Aug 15, 2022
@jboynyc
Copy link
Owner

jboynyc commented Aug 18, 2022

The answer isn't as straightforward as it should be. I'll try to add something like you suggested in the next release. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants