[jira] [Created] (ZEPPELIN-2621) Spark standalone Dockerfile downloads corrupt spark.tgz archive

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (ZEPPELIN-2621) Spark standalone Dockerfile downloads corrupt spark.tgz archive

JIRA jira@apache.org
Anne Rutten created ZEPPELIN-2621:
-------------------------------------

             Summary: Spark standalone Dockerfile downloads corrupt spark.tgz archive
                 Key: ZEPPELIN-2621
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-2621
             Project: Zeppelin
          Issue Type: Bug
    Affects Versions: 0.7.0
            Reporter: Anne Rutten
            Priority: Minor


trying to build a docker image as per [these instructions|https://zeppelin.apache.org/docs/0.7.0/install/spark_cluster_mode.html#1-build-docker-file]

generates the following error:

{quote}
> docker build -t "spark_standalone" .
{quote}
{quote}
(...)
 Step 12 : RUN curl -s http://apache.mirror.cdnetworks.com/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop$HADOOP_PROFILE.tgz | tar -xz -C /usr/local/
 ---> Running in adf4bd575e37

gzip: stdin: unexpected end of file
tar: Unexpected EOF in archive
tar: Unexpected EOF in archive
tar: Error is not recoverable: exiting now
The command '/bin/sh -c curl -s http://apache.mirror.cdnetworks.com/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop$HADOOP_PROFILE.tgz | tar -xz -C /usr/local/' returned a non-zero code: 2
{quote}

I've tracked this down to the .tgz file being downloaded from  http://apache.mirror.cdnetworks.com/ being corrupted. If i substitute with a cloudfront URL, e.g. https://d3kbcqa49mib13.cloudfront.net/,  the image gets built successfully.




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)