Installing the apache web server on an Arch server is as simple as using the "pacman" package manager.
The workhorse of the Internet
Ah, the apache web server. Used to host more web sites than any other web server out there, apache's popularity stems from its flexibility and stable performance. It also doesn't hurt that there's a lot of documentation for it that's just a web search away.
In this article series we'll cover how to install apache and how to dive into its configuration to tailor the web server to your needs. We'll also throw in some instructions for installing PHP, given its widespread use on web servers these days.
Note that we'll usually refer to the apache web server as just "apache", even though technically apache puts out other projects too. Honestly, we do it for the same reason most of the Internet does it: It's easier to just say "apache".
We start at the beginning.
The first step is really a precautionary one before installing the web server. We'll want to make sure that once the web server is running browsers will actually be able to reach it.
You may be running a firewall on your server, and it might be blocking traffic to the standard web server ports, port 80 (for regular connections) and port 443 (for secure connections). We'll talk about how to check iptables, since it's the firewall method used most often (and because it's what we use in our server setup articles), but the same general principle will apply to any other firewall solution.
Check the current firewall rules, to start off:
sudo /sbin/iptables -L
You'll want to look those rules over to make sure ports 80 and 443 are open for business. If it turns out you don't have a firewall running no changes are necessary. If you're using the default rules from our slice setup articles, you should already find ports 80 and 443 are open. Otherwise you'll want to apply rules to open those ports.
Rules like these should suffice for even the strictest iptables configuration:
-I INPUT -p tcp --dport 80 -m state --state NEW,ESTABLISHED -j ACCEPT -I OUTPUT -p tcp --sport 80 -m state --state ESTABLISHED -j ACCEPT -I INPUT -p tcp --dport 443 -m state --state NEW,ESTABLISHED -j ACCEPT -I OUTPUT -p tcp --sport 443 -m state --state ESTABLISHED -j ACCEPT
Installing a basic apache web server is straightforward:
sudo pacman -Sy sudo pacman -S apache
The first command syncs the package repository, and the second installs the base apache package and support for some commonly-used options (like SSL support).
Now that we have apache installed and working properly, we need to make sure it's set to start automatically if the slice is rebooted.
Edit the file:
Toward the bottom of the file is the "DAEMONS" setting:
# ----------------------------------------------------------------------- # DAEMONS # ----------------------------------------------------------------------- # # Daemons to start at boot-up (in this order) # - prefix a daemon with a ! to disable it # - prefix a daemon with a @ to start it up in the background # DAEMONS=(syslog-ng network netfs crond sshd)
Add "httpd" to the end of the list of services in the "DAEMONS" variable, so it looks more like:
DAEMONS=(syslog-ng network netfs crond sshd httpd)
If you start apache now you might see this warning:
httpd: Could not reliably determine the server's fully qualified domain name, using 127.0.0.1 for ServerName
The warning comes from "httpd" because that's the name of the apache process when it's running (and thus what you'd look for in a ps list or top).
We'll take a closer look at apache configuration later in this article series, but for now let's attend to that complaint.
First, let's create a "custom" directory in apache's configuration directory:
sudo mkdir /etc/httpd/conf/custom
Now create a config file to store your server name:
sudo nano /etc/httpd/conf/custom/servername.conf
Put the following line into that file:
Change "demo" to the name of your slice or another handy identifier. Don't use the domain names of any of your sites for this value. We'll want to use those when we set up virtual hosts later in this series.
When you're done save servername.conf.
Finally, edit the main apache config file:
sudo nano /etc/httpd/conf/httpd.conf
Scroll way down to the end of that file, and add this line:
We'll talk about why we did all that work when we talk about apache's config layout in a later article. For now, save that change to httpd.conf.
Now that you've explicitly told apache what your server name should be, let's gracefully restart Apache:
sudo /usr/sbin/apachectl graceful
You shouldn't see the warning this time. If you do, you may want to double-check your servername.conf file.
We used "apachectl" to restart apache just now. That command can be used to start or stop apache in a manner similar to using the init script (/etc/init.d/httpd). If you just run apachectl by itself you get a list of arguments you can send it:
$ /usr/sbin/apachectl Usage: /usr/sbin/httpd [-D name] [-d directory] [-f file] [-C "directive"] [-c "directive"] [-k start|restart|graceful|graceful-stop|stop] [-v] [-V] [-h] [-l] [-L] [-t] [-S] Options: -D name : define a name for use in
directives -d directory : specify an alternate initial ServerRoot -f file : specify an alternate ServerConfigFile -C "directive" : process directive before reading config files -c "directive" : process directive after reading config files -e level : show startup errors of level (see LogLevel) -E file : log startup errors to file -v : show version number -V : show compile settings -h : list available command line options (this page) -l : list compiled in modules -L : list available configuration directives -t -D DUMP_VHOSTS : show parsed settings (currently only vhost settings) -S : a synonym for -t -D DUMP_VHOSTS -t -D DUMP_MODULES : show all loaded modules -M : a synonym for -t -D DUMP_MODULES -t : run syntax check for config files
...Except it says it's talking about "httpd", not apachectl. You can use the arguments above with apachectl, but it's really httpd running them behind the scenes. The apachectl script is a "wrapper" for httpd, providing the ability to use some more friendly arguments to control apache. It just doesn't list all of those friendly arguments in a friendly way when you ask for them. This is what happens when a script writer doesn't put a help response in their script. Tsk.
Fortunately you can get a friendly list of arguments for apachectl right here: start, stop, restart, graceful, graceful-stop, status, and configtest.
The start, stop, and restart commands are pretty self-explanatory, and often they're all you will need. But let's take a moment to briefly explain the other arguments.
graceful | graceful-stop
The "graceful" argument to apachectl asks apache to restart, but to do it in a way that won't interfere with existing connections. The "graceful-stop" argument is similar to "stop", but apache will let existing connections finish their business before cutting them off. The common thread there is that while the core of the web server is restarted or stopped, processes are left in place to continue handling old connections using the old configuration.
So on paper, "graceful" looks pretty good. When it works, you can restart the web server without actually cutting any users off or interrupting their sessions. In practice, graceful doesn't always execute perfectly. Some modules don't work well with graceful restarts, and sometimes you can end up with several hung connections that won't go away. There are also some configuration changes that only take effect after a full restart.
If you want to use "graceful" to restart apache after most configuration changes, just be aware of the possibility that it won't always work like you'd want it to. Make a test connection to the web server immediately after a graceful restart to make sure it accepts new connections using the new configuration. If graceful restarts don't consistently work, you may be better off just using a regular "restart" command instead (it may not be graceful, but it's more reliable).
The "reload" argument for the /etc/rc.d/httpd script works in a similar way to "graceful", with the same caveats.
The "configtest" argument to apachectl doesn't interfere with a running web server. It simply asks apache to skim its configuration files to check for syntax errors. It's good to use this one after any change.
The configtest won't guarantee that a configuration change will actually work, mind you, but it does let you catch the more obvious configuration problems like a missing bracket or misspelled keyword.
status | fullstatus
The status command can present you with a snapshot of what the web server is doing at a given moment, but it usually takes some preparation to get it to work. Since the output is a bit more complex than just whether or not the web server is running, you may not want to go to the effort.
To get a response from the status or fullstatus commands, you would need to enable mod_status on the web server, configure it, and have a text-based web browser like "lynx" or "links" installed on your slice. If you just try to run "apachectl status", apache will let you know what needs to be done.
For details on how to enable apache's mod_status module and read the output, check this article on the subject after you have your basic apache installation in place.
By default, the apache logs are stored in the directory:
You will need to use sudo to look around in that directory since it's restricted to root access. The two main log files you'll find in there are "access_log" and "error_log".
The "access_log" will store all access attempts apache receives. This can be useful for traffic analysis, but it's also handy for troubleshooting if you need to figure out if a connection attempt got through to the web server (or if it's been blocked by something like iptables).
The "error_log" stores the errors apache reports. This can include both errors reported by the binary or by modules (like PHP complaining about not finding an SQL library) and errors the web server has sent to users (like a "file not found" error).
Test the server
If you navigate to your server's IP address in a web browser:
Arch doesn't install a default index page with apache, so you should see a page with "Index of /" at the top, and what's basically an empty directory listing. If you see that, apache is working. Hooray for working!
If not then, well, it doesn't work. If you didn't see the index page and your browser took a minute to tell you it couldn't get to it (like a "Server unavailable" error), then iptables may be blocking access to port 80.
If your browser gives you an error response right away, like a "connection refused" error, then apache may not be running. Try starting it up to be sure:
If that doesn't work, and you didn't get a useful explanation from apache2ctl, check the error log in apache's log directory. Common problems include a mistake in the /etc/apache/apache2.conf file, or another web server already running and keeping apache from attaching itself to port 80.
You should now have a functioning, but very basic, apache installation running on your server. There's more work to be done to properly configure apache and get your site on there, but you've completed the first step.
The next article in this series describes how to install PHP. If you don't want to install PHP for your web server, you can skip that step and go straight to the article discussing the apache configuration layout.
- -- Jered