Basic ideas to solve Spark OOM: Count all the high frequence words in a big table #48

guotong1988 · 2021-03-19T01:56:41Z

The detail question is:

I want to count all the high frequence words in a big table.

I split each sentence of each row, then flatmap to one word per row, then groupby, then count the word number in each group.

It will OOM.

The text was updated successfully, but these errors were encountered:

guotong1988 changed the title ~~Basic ideas to solve Spark OOM?~~ Basic ideas to solve Spark OOM: Count all the high frequence words in a big table Mar 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic ideas to solve Spark OOM: Count all the high frequence words in a big table #48

Basic ideas to solve Spark OOM: Count all the high frequence words in a big table #48

guotong1988 commented Mar 19, 2021 •

edited

Loading

Basic ideas to solve Spark OOM: Count all the high frequence words in a big table #48

Basic ideas to solve Spark OOM: Count all the high frequence words in a big table #48

Comments

guotong1988 commented Mar 19, 2021 • edited Loading

guotong1988 commented Mar 19, 2021 •

edited

Loading