在Pycharm上使用python执行spark程序时遇到错误
问题描述:
我在PyCharm上编写了一个名为Wordcount.py的python文件。 这是Wordounct.py在Pycharm上使用python执行spark程序时遇到错误
import sys,os from pyspark import SparkContext
sc = SparkContext()
myrdd = sc.textFile("passwd")
myrdd.count()
当我运行它的内容,我中遇到上控制台
显示一个错误以下是错误信息
/usr/local/bin/python3 /home/plters/PycharmProjects/Spark21/Wordcount.py
Traceback (most recent call last):
File "/home/plters/PycharmProjects/Spark21/Wordcount.py", line 2, in <module>
from pyspark import SparkContext
File "/opt/spark2/python/pyspark/__init__.py", line 44, in <module>
from pyspark.context import SparkContext
File "/opt/spark2/python/pyspark/context.py", line 29, in <module>
from py4j.protocol import Py4JError
ImportError: No module named 'py4j
我应该怎么办?
答
看起来py4j模块丢失,从终端只需安装
pip install py4j