I see the following error when I attempt to import koalas from databricks. I am using pyspark v2.4.5 and I'm able to successfully connect to my Spark cluster. It seems that using python 3.5 and connecting to Databricks Runtime 5.x works. I created a clean virtual environment and installed koalas via conda install -c conda-forge koalas
. I have also attempted to rollback kolas to an earlier version to no avail. Please let me know if I can help provide additional details.
File "C:/...", line 1, in <module>
import databricks.koalas as ks
File "C:\ProgramData\Anaconda3\envs\...\lib\site-packages\databricks\koalas\__init__.py", line 55, in <module>
from databricks.koalas.frame import DataFrame
File "C:\ProgramData\Anaconda3\envs\...\lib\site-packages\databricks\koalas\frame.py", line 78, in <module>
from databricks.koalas.plot import KoalasFramePlotMethods
File "C:\ProgramData\Anaconda3\envs\...\lib\site-packages\databricks\koalas\plot.py", line 22, in <module>
from matplotlib.axes._base import _process_plot_format
ModuleNotFoundError: No module named 'matplotlib.axes'