Installing a Jupyter Notebook on a Linode Behind an Apache Reverse Proxy
Traducciones al EspañolEstamos traduciendo nuestros guías y tutoriales al Español. Es posible que usted esté viendo una traducción generada automáticamente. Estamos trabajando con traductores profesionales para verificar las traducciones de nuestro sitio web. Este proyecto es un trabajo en curso.
Jupyter Notebook is an interactive, enhanced shell that can be run within a web browser. Notebook is popular among data scientists, and supports inline rendering of figures, exporting to a variety of formats, and LaTeX for mathematical notation. This guide aims to configure on a Linode a public Jupyter Notebook server that will facilitate remote access to your computation needs using Apache as a reverse proxy.
Before You Begin
Because this guide is written for Linodes running Ubuntu 16.04, you should:
If you have not already done so, create a Linode account and Compute Instance. See our Getting Started with Linode and Creating a Compute Instance guides.
Follow our Setting Up and Securing a Compute Instance guide to update your system. You may also wish to set the timezone, configure your hostname, create a limited user account, and harden SSH access.
Install Anaconda Package Manager
Anaconda is a package manager with built-in support for virtual environments. It comes with each installation of Jupyter Notebook and is recommended by Jupyter’s official documentation.
SSH into your Linode and install the latest version of Anaconda. The example below downloads the version of Anaconda with Python 3.6 (but Python 2.7 is also available):
wget https://repo.continuum.io/archive/Anaconda3-4.4.0-Linux-x86_64.sh
Run the installation script:
bash ~/Anaconda3-4.4.0-Linux-x86_64.sh
Follow the prompts in the terminal, accept the terms, and allow the installer create a PATH in
.bashrc
.Reload the new
.bashrc
changes with:exec bash
Create a Self-Signed Certificate
The official documentation recommends generating a self-signed SSL certificate to prevent sending unencrypted passwords in the Notebook from the browser. This is especially important because Jupyter Notebooks can run bash scripts. If you have a domain name, consider using Certbot rather than a self-signed certificate.
Create a self-signed certificate valid for 365 days:
openssl req -x509 -nodes -days 365 -newkey rsa:2048 -keyout mykey.key -out mycert.pem
This command will create a
mykey.key
andmycert.pem
.Restrict the files to only be read by the owner:
chmod 400 mykey.key chmod 400 mycert.pem
Configure Jupyter Notebook
Generate a new configuration file. This will create a
~/.jupyter
directory:jupyter notebook --generate-config
Create a password for the notebook:
jupyter notebook password
Copy the password from the newly created
jupyter_notebook_config.json
file.Uncomment the following lines in the configuration file:
- File: ~/.jupyter/jupyter_notebook_config.py
1 2 3 4 5 6 7 8
c.NotebookApp.allow_origin = '*' c.NotebookApp.base_url = '/jupyter' c.NotebookApp.certfile = '/absolute/path/to/mycert.pem' c.NotebookApp.ip = 'localhost' c.NotebookApp.keyfile = '/absolute/path/to/mykey.key' c.NotebookApp.open_browser = False c.NotebookApp.password = 'paste_hashed_password_here' c.NotebookApp.trust_xheaders = True
Configure Apache Reverse Proxy
Install Apache 2.4:
sudo apt install apache2
Enable a2enmod:
sudo a2enmod
A prompt will appear with a list of mods for Apache:
Your choices are: access_compat actions alias allowmethods asis auth_basic auth_digest auth_form authn_anon authn_core authn_dbd authn_dbm authn_file authn_socache authnz_fcgi authnz_ldap authz_core authz_dbd authz_dbm authz_groupfile authz_host authz_owner authz_user autoindex buffer cache cache_disk cache_socache cgi cgid charset_lite data dav dav_fs dav_lock dbd deflate dialup dir dump_io echo env expires ext_filter file_cache filter headers heartbeat heartmonitor ident include info lbmethod_bybusyness lbmethod_byrequests lbmethod_bytraffic lbmethod_heartbeat ldap log_debug log_forensic lua macro mime mime_magic mpm_event mpm_prefork mpm_worker negotiation proxy proxy_ajp proxy_balancer proxy_connect proxy_express proxy_fcgi proxy_fdpass proxy_ftp proxy_html proxy_http proxy_scgi proxy_wstunnel ratelimit reflector remoteip reqtimeout request rewrite sed session session_cookie session_crypto session_dbd setenvif slotmem_plain slotmem_shm socache_dbm socache_memcache socache_shmcb speling ssl status substitute suexec unique_id userdir usertrack vhost_alias xml2enc Which module(s) do you want to enable (wildcards ok)?
Enable
mod_proxy
,mod_proxy_http
,mod_proxy_wstunnel
,mod_ssl
, andmod_headers
:proxy proxy_http proxy_https proxy_wstunnel ssl headers
Navigate to the
/etc/apache2/sites-available
directory. Copy the default configuration file then add directives on virtualhost:sudo cp 000-default.conf jupyter.conf
Comment out
DocumentRoot
to allowhttps://your-domain-name/
to redirect ashttps://your-domain-name/jupyter
. The<Location>
directive connects the websocket in order to allow the default kernel to run:- File: /etc/apache2/sites-available/jupyter.conf
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
<VirtualHost *:443> ServerAdmin webmaster@localhost # DocumentRoot /var/www/html ErrorLog ${APACHE_LOG_DIR}.error.log CustomLog ${APACHE_LOG_DIR}/access.log combined SSLCertificateFile /absolute/path/to/mycert.pem SSLCertificateKeyFile /absolute/path/to/mykey.key SSLProxyEngine On SSLProxyVerify none SSLProxyCheckPeerCN off SSLProxyCheckPeerName off SSLProxyCheckPeerExpire off ServerName localhost ProxyPreserveHost On ProxyRequests Off LogLevel debug ProxyPass /jupyter https://localhost:8888/jupyter ProxyPassReverse /jupyter https://localhost:8888/jupyter RequestHeader set Origin "https://localhost:8888" Redirect permanent / https://your-domain-name/jupyter <Location "/jupyter/api/kernels"> ProxyPass wss://localhost:8888/jupyter/api/kernels ProxyPassReverse wss://localhost:8888/jupyter/api/kernels </Location> </VirtualHost>
Note The/jupyter
url path can have any name as long as it matches the base url path defined in the Jupyter notebook configuration file.Enable the newly created configuration:
sudo a2ensite jupyter.conf
Restart the Apache server:
sudo service apache2 restart
Start the Jupyter Notebook:
jupyter notebook
Run Jupyter Notebook
On your local machine, navigate to
https://your-domain-name/
whereyour-domain-name
is the IP address of your Linode or your selected domain name. If using a self-signed certificate, your browser might require that you confirm a security exception:If Apache is configured properly, Jupyter prompts you to log in:
Create a new notebook using a Python kernel:
The Notebook is ready to run Python code or additional kernels added in the future:
Note this setup is for a single-user only; simultaneous users on the same Notebook may cause unpredictable results. For a multi-user server, consider using JupyterHub instead.
More Information
You may wish to consult the following resources for additional information on this topic. While these are provided in the hope that they will be useful, please note that we cannot vouch for the accuracy or timeliness of externally hosted materials.
This page was originally published on