Aqib Fayyaz
01/08/2022, 1:54 PMUnable to locate package openjdk-8-jdk
. Is the issue is because of base image, for other docker files where spark run we have ubuntu 20.04 as base image but for prefect we have prefect as base image. Below is the docker file
FROM prefecthq/prefect:0.15.6-python3.8
# for spark
ENV JAVA_HOME="/usr/lib/jvm/java-8-openjdk-amd64"
ENV SPARK_HOME="/spark/spark-3.1.2-bin-hadoop3.2/"
ENV PYTHONPATH="/spark/spark-3.1.2-bin-hadoop3.2/python:$PYTHONPATH"
ENV PYSPARK_PYTHON="python3"
ENV PATH="$PATH:/spark/spark-3.1.2-bin-hadoop3.2/bin"
ENV PATH="$PATH:$JAVA_HOME"
ENV PATH="$PATH:$JAVA_HOME/bin"
ENV PATH="$PATH:$JAVA_HOME/jre/bin"
ENV SPARK_LOCAL_IP="127.0.0.1"
WORKDIR /
COPY . /
RUN apt-get update && \
apt-get install -y \
openjdk-8-jdk \
python3-pip
ADD <https://downloads.apache.org/spark/spark-3.1.2/spark-3.1.2-bin-hadoop3.2.tgz> spark.tgz
RUN mkdir -p spark && \
tar -zxvf spark.tgz -C spark/ && \
rm spark.tgz
# for prefect
RUN pip install feast feast-postgres sqlalchemy google-auth scikit-learn
RUN pip install feast[gcp]
RUN pip install --upgrade google-cloud
RUN pip install --upgrade google-cloud-bigquery
RUN pip install --upgrade google-cloud-storage
WORKDIR /opt/prefect
COPY flow_utilities/ /opt/prefect/flow_utilities/
COPY flow_utilities_bigQ_Datastore/ /opt/prefect/flow_utilities_bigQ_Datastore/
COPY setup.py /opt/prefect/setup.py
COPY .feastignore /opt/prefect/.feastignore
RUN pip install .
Anna Geller
Aqib Fayyaz
01/08/2022, 3:15 PMAqib Fayyaz
01/08/2022, 3:56 PMAqib Fayyaz
01/08/2022, 4:02 PMAnna Geller
Open AIMP
01/10/2022, 6:10 AMAnna Geller
Aqib Fayyaz
01/10/2022, 9:40 AMAnna Geller
COPY setup.py /opt/prefect/setup.py
RUN pip install .