I need a little bit aid. I want to see each of the elements of the rdd (rddseparar) The idea is to count the words of a text, eliminating the special characters and this is one of de steps for get it
import re
fileName = "/databricks-datasets/cs100/lab1/data-001/shakespeare.txt"
rdd = sc.textFile(fileName)
separar = re.split(r"[^A-Za-z\s\d]", rdd.collect()[0])
separarPalabras = [word for frase in separar for word in frase.split()]
rddseparar = sc.parallelize(separarPalabras)
print(rddseparar.collect())
When I run the code, I should be able to see each of the elements in the rddseparate, but I don't.
Spark code execution output
Why can't I see the elements of the rddseparar ?
(2) Spark Jobs ['1609']