.
├── dist
├── docs-archive
├── landing-pages
│ ├── dist
│ ├── site
│ │ ├── assets
│ │ │ ├── icons
│ │ │ └── scss
│ │ ├── content
│ │ │ └── en
│ │ │ ├── blog
│ │ │ ├── community
│ │ │ ├── docs
│ │ │ ├── install
│ │ │ ├── meetups
│ │ │ ├── privacy-notice
│ │ │ ├── roadmap
│ │ │ └── use-cases
│ │ ├── data
│ │ ├── layouts
│ │ ├── static
│ │ │ ├── icons
│ │ │ └── integration-logos
│ │ └── themes
│ │ └── docsy
│ └── src
│ └── js
├── license-templates
└── sphinx_airflow_theme
├── demo
└── sphinx_airflow_theme
Work with the site and documentation requires that your computer be properly prepared. Most tasks can be done by the site.sh script
The following applications must be installed to use the project:
- git
- docker
It is also worth adding SSH keys for the github.com
server to trusted ones. It is necessary to clone repositories. You can do this using following command:
ssh-keyscan -t rsa -H github.com >> ~/.ssh/known_hosts
Debian instalation
To install git on Debian, run the following command:
sudo apt install git -y
To install docker, run the following command:
curl -fsSL https://get.docker.com -o get-docker.sh && sh get-docker.sh
sudo usermod -aG docker $USER
Git must have commit author information configured, run these commands
git config --global user.email '<[email protected]>'
git config --global user.name '<you name>'
The project uses many static checks using fantastic pre-commit. Every change is checked on CI and if it does not pass the tests it cannot be accepted. If you want to check locally then you should install Python3.6 or newer together with pip and run following command to install pre-commit:
pip install -r requirements.txt
To turn on pre-commit checks for commit operations in git, enter:
pre-commit install
To run all checks on your staged files, enter:
pre-commit run
To run all checks on all files, enter:
pre-commit run --all-files
Pre-commit check results are also attached to your PR through integration with Travis CI.
To clone repository from github.com to local disk, run following command
git clone [email protected]:apache/airflow-site.git
git submodule update --init --recursive
In order to run an environment for the project, make sure that you have Docker installed. Then, use the site.sh
script to work with the website in a Docker container.
site.sh
provides the following commands.
build-site Prepare dist directory with landing pages and documentation
preview-landing-pages Starts the web server with preview of the website
build-landing-pages Builds a landing pages
prepare-theme Prepares and copies files needed for the proper functioning of the sphinx theme.
shell Start shell
build-image Build a Docker image with a environment
install-node-deps Download all the Node dependencies
check-site-links Checks if the links are correct in the website
lint-css Lint CSS files
lint-js Lint Javascript files
cleanup Delete the virtual environment in Docker
stop Stop the environment
help Display usage
To add a new blogpost with pre-filled frontmatter, in <ROOT DIRECTORY>/landing-pages/site
run:
hugo new blog/my-new-blogpost.md
That will create a markdown file <ROOT DIRECTORY>/landing-pages/site/content/<LANGUAGE VERSION>/blog/my-new-blogpost.md
with following content:
---
title: "My New Blogpost"
linkTitle: "My New Blogpost"
author: "Your Name"
twitter: "Your Twitter ID (optional, remove if not needed)"
github: "Your Github ID (optional, remove if not needed)"
linkedin: "Your LinkedIn ID (optional, remove if not needed)"
description: "Description"
tags: []
date: "2019-11-19"
draft: true
---
Below frontmatter, put your blogpost content.
When you finish your writing blogpost, remember to remove draft: true
from frontmatter.
To add a new blogpost manually, create a markdown file in <ROOT DIRECTORY>/landing-pages/site/content/<LANGUAGE VERSION>/blog/<filename>.md
.
The filename will also serve as URL for your blogpost.
Then, at the top of the file, add frontmatter in following format:
---
title: "<blogpost title>"
linkTitle: "<blogpost link title>"
author: "<author's name>"
twitter: "<optional - author's Twitter ID>"
github: "<optional - author's Github ID>"
linkedin: "<optional - author's Linkedin ID>"
description: "<short description>"
tags: ["<tag1>", "<tag2>", ...]
date: <date in YYYY-MM-DD format>
---
Below frontmatter, put your blogpost content.
In order to add a new blog post with images, you need to add it in sub-folder of the "blog" folder and name your markdown file "index.md". Images placed in this folder can be referred to directly from the markdown file using this directive:
![Alt text](image.png)
To add a new case study with pre-filled frontmatter, in <ROOT DIRECTORY>/landing-pages/site
run:
hugo new use-cases/my-use-case.md
That will create a markdown file <ROOT DIRECTORY>/landing-pages/site/content/<LANGUAGE VERSION>/use-cases/my-use-case.md
with following content:
---
title: "My Use Case"
linkTitle: "My Use Case"
quote:
text: "Quote text"
author: "Quote's author"
logo: "logo-name-in-static-icons-directory.svg"
draft: true
---
##### What was the problem?
text
##### How did Apache Airflow help to solve this problem?
text
##### What are the results?
text
When you finish your writing blogpost, remember to remove draft: true
from frontmatter.
To add a new case study manually, create a markdown file in <ROOT DIRECTORY>/landing-pages/site/content/<LANGUAGE VERSION>/use-cases/<filename>.md
.
The filename will also serve as URL for the case study.
Then, at the top of the file, add frontmatter in following format:
---
title: "<case study title>"
linkTitle: "<case study link title>"
quote:
text: "<quote text>"
author: "<quote author's name>"
logo: "<logo filename (with extension)>"
---
Below frontmatter, put your blogpost content in following format:
#### What was the problem?
<text>
##### How did Apache Airflow help to solve this problem?
<text>
#### What are the results?
<text>
---
Important - put the logo file in <ROOT DIRECTORY>/landing-pages/site/usecase-logos/icons/
directory. Then, in the frontmatter,
refer to it just by filename.
Example:
Path to logo file: /landing-pages/site/static/usecase-logos/my-case-study.svg
Case study in /landing-pages/site/content//use-cases/my-case-study.md
---
title: "<case study title>"
linkTitle: "<case study link title>"
quote:
text: "<quote text>"
author: "<quote author's name>"
logo: "my-case-study.svg"
---
#### What was the problem?
<text>
##### How did Apache Airflow help to solve this problem?
<text>
#### What are the results?
<text>
In order to add a new integration, add an entry in <ROOT DIRECTORY>/landing-pages/site/static/integrations.json
file,
following the format:
{
"name": "<integration name>",
"url": "<url to docs with integration description>",
"logo": "/integration-logos/<filename with extension>"
}
Integrations are displayed in random order, which might be different on each site reload. To search for your integration, use the search functionality.
Providing an integration logo is optional. However, please take note that integrations with logo are be promoted by being displayed before integrations without a logo.
In order to add an upcoming meetup, find your group in <ROOT DIRECTORY>/landing-pages/site/static/meetups.json
file
and put the meetup's date in following format:
MON, JAN 01, 6:00 PM
If your meetup group isn't on the list, add it following the format of existing entries.
-
Github Action has been configured to automatically publish artifacts for pull requests, so we can preview changes from Github Ui.
-
After downloading the artifacts, unpack the archive and start the local HTTP server, run the following command.
Python 3
python -m http.server --cgi 8000
Python 2.7
python -m SimpleHTTPServer 8000
Building documentation for the Apache Airlfow project also requires Python3.6 with pip and graphviz. You also need to have additional apache/airflow
repository available.
You should install and set up all software from "Working with the project/Prerequisite tasks" section.
The following additional application must be installed to use the project:
- python3.6 or newer
- pip
- graphviz
Debian instatation:
To install graphviz, pip for Debian, run following commands:
sudo apt install graphviz python3-pip -y
You should also add $HOME/.local/bin
to $PATH
, run followin command:
export PATH=$HOME/.local/bin:$PATH;
It is necessary to configure 2 variables that point to directories with repositories and one that describe current Airflow version. The next steps will assume that these variables are available.
AIRFLOW_REPO=$HOME/airflow
AIRFLOW_SITE_REPO=$HOME/airflow-site
AIRFLOW_VERSION=1.10.9
git clone [email protected]:apache/airflow.git "${AIRFLOW_REPO}"
git clone [email protected]:apache/airflow-site.git "${AIRFLOW_SITE_REPO}"
cd "${AIRFLOW_SITE_REPO}" && git submodule update --init --recursive
To release a new documentation, follow these steps:
-
To prepare and insstall Sphinx theme, run following commands:
cd "${AIRFLOW_SITE_REPO}" && bash site.sh build-site cd "${AIRFLOW_SITE_REPO}" && bash site.sh prepare-theme cd "${AIRFLOW_SITE_REPO}/sphinx_airflow_theme" && pip3 install -e .
-
To build documentation, run following commands:
cd "${AIRFLOW_REPO}" && git checkout "${AIRFLOW_VERSION}" cd "${AIRFLOW_REPO}" && pip3 install -e '.[doc]' cd "${AIRFLOW_REPO}/docs/" && bash build.sh
-
Copy generated files from
${AIRFLOW_REPO}/docs/_build/html
to${AIRFLOW_SITE_REPO}/docs-archive/<version>/
mkdir -p "${AIRFLOW_SITE_REPO}/docs-archive/${AIRFLOW_VERSION}" cp -r "${AIRFLOW_REPO}/docs/_build/html/." "${AIRFLOW_SITE_REPO}/docs-archive/${AIRFLOW_VERSION}"
You can also mark the release as the latest stable version:
echo "${AIRFLOW_VERSION}" > "${AIRFLOW_SITE_REPO}/docs-archive/stable.txt"
-
Make a commit with generated documentation only.
cd "${AIRFLOW_SITE_REPO}" && git checkout -b "docs-for-${AIRFLOW_VERSION}" cd "${AIRFLOW_SITE_REPO}" && git add . cd "${AIRFLOW_SITE_REPO}" && git commit -m "Docs for ${AIRFLOW_VERSION}"
-
To send changes to the remote server:
cd "${AIRFLOW_SITE_REPO}" && git push origin "docs-for-${AIRFLOW_VERSION}"
Github Action has been configured to automatically publish artifacts for pull requests, so you can preview changes.
It is recommended to use Github Action to publish changes to the website, but in case of problems it is also possible to publish changes to the website manually.
You should install and set up all software from "Working with the project/Prerequisite tasks" section.
The next steps will assume that these variable are available.
AIRFLOW_SITE_REPO=$HOME/airflow-site
AIRFLOW_SITE_ASF_SITE_REPO=$HOME/airflow-site-asf-site
You need to have the two copy of apache/airflow-site
repository. The first contains master
branch checkoued, the second - asf-site
. To clone repository run following commands:
git clone [email protected]:apache/airflow-site.git "${AIRFLOW_SITE_REPO}"
git clone [email protected]:apache/airflow-site.git "${AIRFLOW_SITE_ASF_SITE_REPO}"
cd "${AIRFLOW_SITE_ASF_SITE_REPO}" && git checkout asf-site
cd "${AIRFLOW_SITE_REPO}" && git submodule update --init --recursive
Known issues:
Git worktree does not work properly with repositories that have submodules. Therefore, do not use this features and make multiple full copies of the repositories.
-
To run build site, run following commnad
cd "${AIRFLOW_SITE_REPO}" && bash site.sh build-landing-pages cd "${AIRFLOW_SITE_REPO}" && bash site.sh build-site
-
Remove all files from
asf-site
branchcd "${AIRFLOW_SITE_ASF_SITE_REPO}" && git ls-files | xargs -P 16 rm -rf
-
Copy new release
cp -vr "${AIRFLOW_SITE_REPO}/dist/." "${AIRFLOW_SITE_ASF_SITE_REPO}"
-
Commit changes
cd "${AIRFLOW_SITE_ASF_SITE_REPO}" && git add . cd "${AIRFLOW_SITE_ASF_SITE_REPO}" && git commit -m "Update - $(date)"
-
Push changess
cd "${AIRFLOW_SITE_ASF_SITE_REPO}" && git push origin asf-site
You must have the software discussed in "Working with the project/Prerequisite tasks" and "How to release new documentation" section installed.
It is necessary to configure 3 variables that point to directories with repositories and one that describe current Airflow version. The next steps will assume that these variables are available.
AIRFLOW_REPO=$HOME/airflow
AIRFLOW_SITE_REPO=$HOME/airflow-site
AIRFLOW_SITE_REPO_ASF_SITE=$HOME/airflow-site
AIRFLOW_VERSION=1.10.9
Run following commands to do a lot of magic in one go.
# Clone repository
git clone [email protected]:apache/airflow.git "${AIRFLOW_REPO}"
cd "${AIRFLOW_REPO}" && git checkout "${AIRFLOW_VERSION}"
git clone [email protected]:apache/airflow-site.git "${AIRFLOW_SITE_REPO}"
cd "${AIRFLOW_SITE_REPO}" && git submodule update --init --recursive
cd "${AIRFLOW_SITE_REPO}" && git checkout -b "docs-${AIRFLOW_VERSION}"
git clone [email protected]:apache/airflow-site.git "${AIRFLOW_SITE_ASF_SITE_REPO}"
cd "${AIRFLOW_SITE_ASF_SITE_REPO}" && git checkout asf-site
# Install Python dependencies for Airflow
cd "${AIRFLOW_REPO}" && git checkout 1.10.9 && pip3 install -e .[doc]
# Build image for site environment
cd "${AIRFLOW_SITE_REPO}" && bash site.sh build-image
# Install themes
cd "${AIRFLOW_SITE_REPO}" && bash site.sh build-site
cd "${AIRFLOW_SITE_REPO}" && bash site.sh prepare-theme
cd "${AIRFLOW_SITE_REPO}/sphinx_airflow_theme/" && pip3 install -e .
# Build docs
cd "${AIRFLOW_REPO}/docs/" && bash build.sh
# Copy docs
rm -rf "${AIRFLOW_SITE_REPO}/docs-archive/${AIRFLOW_VERSION}"
mv "${AIRFLOW_REPO}/docs/_build/html" "${AIRFLOW_SITE_REPO}/docs-archive/${AIRFLOW_VERSION}"
# Set current version as stable
echo "${AIRFLOW_VERSION}" > "${AIRFLOW_SITE_REPO}/docs-archive/stable.txt"
# Create commit
cd "${AIRFLOW_SITE_REPO}" && git add .
cd "${AIRFLOW_SITE_REPO}" && git commit -m 'Docs for ${AIRFLOW_VERSION}'
# Push new documentation
cd "${AIRFLOW_SITE_REPO}" && git push origin origin
Once it is accepted and merged, wait for Github Action build to publish the changes.
You should set following variable:
GCP_ZONE="europe-west3-b"
GCP_INSTANCE_NAME="test-airflow-docs-build"
If you want to create VM on GCP, you could use following command:
gcloud compute instances create "${GCP_INSTANCE_NAME}" \
--custom-memory=32GB \
--custom-cpu=6 \
--zone="${GCP_ZONE}" \
--image-family="debian-10" \
--image-project="debian-cloud"
To connect via SSH and forward local SSH key to VM, forwardd ports from VM run following command:
gcloud beta compute \
ssh \
--zone "${GCP_ZONE}" \
"${GCP_INSTANCE_NAME}" \
-- \
-A \
-L "8000:127.0.0.1:8000" \
-L "3000:127.0.0.1:3000" \
-L "1313:127.0.0.1:1313"
To delete VM, run following command:
gcloud compute instances delete "${GCP_INSTANCE_NAME}" --zone="${GCP_ZONE}"
If you wanna create RAM disk, run following command:
sudo mkdir -p /mnt/ramdisk && sudo mount -t tmpfs -o size=16g tmpfs /mnt/ramdisk
To force Python to use RAM disk, run following command:
rm -rf $HOME/.local/lib/python3.7
mkdir -p $HOME/.local/lib/
mkdir -p /mnt/ramdisk/python3.7
ln -s /mnt/ramdisk/python3.7 $HOME/.local/lib/python3.7
All environment variable used in guide should look as following:
AIRFLOW_REPO=/mnt/ramdisk/airflow
AIRFLOW_SITE_REPO=/mnt/ramdisk/airflow-site
AIRFLOW_SITE_ASF_SITE_REPO=/mnt/ramdisk/airflow-site-asf-site