uw-psych
diff --git a/‎README.md
+67-30 b/‎README.md
+67-30
diff --git a/‎README.md.esh
-64 b/‎README.md.esh
-64
diff --git a/‎Singularity
+77-70 b/‎Singularity
+77-70
@@ -7,7 +7,8 @@ This container provides a convenient way to run [ollama](https://github.com/olla
 First, you'll need to log in to Hyak. If you've never set this up, go [here](https://uw-psych.github.io/compute_docs).
 
 ```bash
-ssh your-uw-netid@klone.hyak.uw.edu # Replace `your-uw-netid` with your UW NetID
+# Replace `your-uw-netid` with your UW NetID:
+ssh your-uw-netid@klone.hyak.uw.edu
 ```
 
 Then, you'll need to request a compute node. You can do this with the `salloc` command:
@@ -18,66 +19,102 @@ Then, you'll need to request a compute node. You can do this with the `salloc` c
 salloc --account escience --partition gpu-a40 --mem 64G -c 2 --time 1:00:00 --gpus 1
 ```
 
-One you're logged in to the compute node, you should set up your cache directories and Apptainer settings.
+### Building the container
 
-👉 *If you're following this tutorial, **you should do this every time you're running ollama on Hyak!** This is because the default settings for Apptainer will use your home directory for caching, which will quickly fill up your home directory and cause your jobs to fail. If you are aware of this and have already set `APPTAINER_CACHEDIR`, you can remove the line that sets `APPTAINER_CACHEDIR`.*
+Next, you'll need to build the container. In this example, we'll build the container in a directory in your scratch space. You can change the path to wherever you'd like to build the container.
 
 ```bash
-# Do this in every session where you're running ollama on Hyak!
+mkdir -p "/gscratch/scrubbed/${USER}/ollama"
+cd "/gscratch/scrubbed/${USER}/ollama"
+git clone https://github.com/uw-psych/ollama-container
+cd ollama-container
+apptainer build ollama.sif Singularity
+```
+
+#### Specifying a different version of `ollama`
+
+By default, the container will install the latest version of `ollama`. If you want to install a specific version, you can specify the version with the `OLLAMA_VERSION` build argument. The most recent version tested with this container is `0.5.8`.
+
+### Starting the `ollama` server
+
+The model files that `ollama` uses are stored by default in your home directory. As these files can be quite large, it's a good idea to store them somewhere else. In this example, we'll store them in your scratch space.
 
-# Set up cache directories:
-export APPTAINER_CACHEDIR="/gscratch/scrubbed/${USER}/.cache/apptainer"
+```bash
 export OLLAMA_MODELS="/gscratch/scrubbed/${USER}/ollama/models"
-mkdir -p "${APPTAINER_CACHEDIR}" "${OLLAMA_MODELS}"
+mkdir -p "${OLLAMA_MODELS}"
+```
+
+You should run the command above every time you start a new server. If you want to run it automatically every time you log in, you can add it to your `.bashrc` file.
 
-# Set up Apptainer:
-export APPTAINER_BIND=/gscratch APPTAINER_WRITABLE_TMPFS=1 APPTAINER_NV=1
+Next, you'll have to start the `ollama` server. You can set the port for the server with the `OLLAMA_PORT` environment variable or leave it unset to use a random port.
+
+```bash
+# Start the ollama server as an Apptainer instance named "ollama-$USER":
+# --nv: Use the NVIDIA GPU
+# --writable-tmpfs: Use a writable tmpfs for the cache directory
+# --bind /gscratch: Bind /gscratch to the container
+apptainer instance start --nv --writable-tmpfs --bind /gscratch ollama.sif ollama-$USER
 ```
 
-Next, build the container. (You only need to do this once.)
+### Running `ollama` commands
+
+To run `ollama` commands, execute the `apptainer run` command with your instance as the first argument and the `ollama` command as the second argument.
+
+For example, to get help with the `ollama` command, run:
 
 ```bash
-mkdir -p "/gscratch/scrubbed/${USER}/ollama"
-cd "/gscratch/scrubbed/${USER}/ollama"
-git clone https://github.com/uw-psych/ollama-container
-cd ollama-container
-apptainer build ollama.sif Singularity
+apptainer run instance://ollama-$USER ollama help
 ```
 
-Next, you'll have to start the ollama server. Before you do this, you'll need to find an open port to use. You can do this with the `random-port` command embedded in this container:
+You can start an interactive prompt with the following command:
 
 ```bash
-export OLLAMA_PORT="$(apptainer run ollama.sif random-port)"
+apptainer run instance://ollama-$USER ollama run qwen:0.5b
 ```
 
+Or provide the prompt on the command line and return JSON output non-interactively:
+
 ```bash
-# Start the ollama server (make sure you include the `&` at the end to run it in the background):
-apptainer run ollama.sif serve &
+# NOTE: Not all models support JSON output
+# NOTE: Wrap the prompt in single quotes to avoid issues with special characters
+apptainer run instance://ollama-$USER ollama run qwen:0.5b --format=json --prompt 'Who are you?'
+```
 
-# Wait a few seconds for the server to start up:
-sleep 5
+For other models, you can replace `qwen:0.5b` with the name of the model you want to use. You can find a list of available models [here](https://ollama.ai/library).
+
+To show the models on the server, run:
+
+```bash
+apptainer run instance://ollama-$USER ollama list
 ```
 
-Once the server is running, you can start an interactive prompt with the following command:
+To show the currently running models, run:
 
 ```bash
-# Start an interactive prompt with the qwen:0.5b model:
-apptainer run ollama.sif run qwen:0.5b
+apptainer run instance://ollama-$USER ollama ps
 ```
 
-For other models, you can replace `qwen:0.5b` with the name of the model you want to use. You can find a list of available models [here](https://ollama.ai/library).
+To stop the server, run:
 
-To list available models, try:
+```bash
+apptainer instance stop ollama-$USER
+```
+
+See the [documentation](https://github.com/ollama/ollama) for more information on how to use `ollama`.
+
+#### Listing available models and tags
+
+This repository includes a custom command to list available models and tags at (https://ollama.com/library). This command is not part of the `ollama` package and is only available in this container. It is useful for finding the names of models and tags to use with the `ollama` command, but it is not guaranteed to work in the future.
+
+To list available models, try the following command with a running instance:
 
 ```bash
-apptainer run ollama.sif available
+apptainer run instance://ollama-$USER available-models
 ```
 
 To list available tags for a model, try:
 
 ```bash
 # Replace `qwen` with the name of the model you want to check:
-apptainer run ollama.sif available tags qwen
+apptainer run instance://ollama-$USER available-tags qwen
 ```
-
-See the [documentation](https://github.com/ollama/ollama) for more information on how to use `ollama`.
 
@@ -2,97 +2,104 @@ Bootstrap: docker
 From: ollama/ollama:{{ OLLAMA_TAG }}
 
 %arguments
-	OLLAMA_TAG=0.5.8
+	OLLAMA_TAG=latest
+	OLLAMA_LIBRARY_URL=https://ollama.com/library
 
 %setup
 	[ -n "${APPTAINER_ROOTFS:-}" ] && ./.build-scripts/write-apptainer-labels.sh >"${APPTAINER_ROOTFS}/.build_labels"
 
-%files
-	ollama_available_models.sh /usr/local/bin/ollama-available-models
-
 %post
-	set -ex
-	export DEBIAN_FRONTEND=noninteractive
-	apt-get update -yq
-	apt-get install -yq curl
-	apt-get clean -yq
-	rm -rf /var/lib/apt/lists/*
+	if ! [ -f /etc/debian_version ]; then
+		echo "This container is only fully supported on Debian-based systems. Some commands may not work." >&2
+	else
+		export DEBIAN_FRONTEND=noninteractive
+		apt-get update -yq && \
+			apt-get install -y curl && \
+			apt-get clean &&
+			rm -rf /var/lib/apt/lists/* /var/cache/apt/*
+	fi
 
 %test
 	ollama --help
 
 %runscript
-	set -e
-	get_random_port() {
-		bash -c 'set -e; read LOWERPORT UPPERPORT < /proc/sys/net/ipv4/ip_local_port_range; PORTRANGE=$(( UPPERPORT - LOWERPORT )); while :; do PORT=$(( LOWERPORT + ( RANDOM % PORTRANGE) )); (echo -n > /dev/tcp/127.0.0/${PORT}) &>/dev/null || echo "${PORT}" && exit 0; done; exit 1' || {
-			echo "Failed to find an open port. Exiting." >&2
+	set -o noglob -o errexit
+	if [ $# -eq 0 ] || [ -z "${1:-}" ]; then
+		echo "Usage: $0 ollama [args]" >&2
+		exit 1
+	fi
+	OLLAMA_LIBRARY_URL="${OLLAMA_LIBRARY_URL:-{{ OLLAMA_LIBRARY_URL }}}"
+	case "${1:-}" in
+		available-models)
+			shift
+			command -v curl >/dev/null || { echo "curl not found. Exiting." >&2; exit 1; }
+			curl -sL "${OLLAMA_LIBRARY_URL}" | grep -oP 'href="/library/\K[^"]+' || { echo "No models found at ${OLLAMA_LIBRARY_URL}" >&2; exit 2; }
+			exit 0
+			;;
+		available-tags)
+			shift
+			command -v curl >/dev/null || { echo "curl not found. Exiting." >&2; exit 1; }
+			curl -sL "${OLLAMA_LIBRARY_URL}/$1/tags" | grep -o "$1:[^\" ]*q[^\" ]*" | grep -E -v 'text|base|fp|q[45]_[01]' || { echo "No model tags found at ${OLLAMA_LIBRARY_URL}/$1/tags" >&2; exit 2; }
+			exit 0
+			;;
+		*) 	;;
+	esac
+	# Run the ollama command with the remaining arguments:
+	"$@"
+
+%startscript
+	set -o errexit -o noglob
+	if [ -z "${OLLAMA_PORT:-}" ]; then
+		echo "OLLAMA_PORT not set. Finding a random port to use" >&2
+		get_random_port() {
+			bash -c 'set -e; read LOWERPORT UPPERPORT < /proc/sys/net/ipv4/ip_local_port_range; PORTRANGE=$(( UPPERPORT - LOWERPORT )); while :; do PORT=$(( LOWERPORT + ( RANDOM % PORTRANGE) )); (echo -n > /dev/tcp/127.0.0/${PORT}) &>/dev/null || echo "${PORT}" && exit 0; done; exit 1'
+		}
+		OLLAMA_PORT="$(get_random_port)" || {
+			echo "Failed to get random port. Exiting." >&2
 			exit 1
 		}
-	}
-	
-	if [ "${1:-}" = "random-port" ]; then
-		get_random_port
-		exit $?
 	fi
-	
-	# Set up OLLAMA_HOST if OLLAMA_PORT is set:
-	export OLLAMA_HOST="${OLLAMA_HOST:-127.0.0.1${OLLAMA_PORT:+:${OLLAMA_PORT}}}"
-	
+
 	# If we're runninng on klone, we should have access to /gscratch/scrubbed.
 	# However, by default, /gscratch is not mounted in the container, whereas /mmfs1 is.
 	# /gscratch is the same as /mmfs1/gscratch, so we can use /mmfs1/gscratch/scrubbed.
-	if [ -d "/gscratch/scrubbed" ]; then
-		SCRUBBED_DIR="/gscratch/scrubbed/${USER}"
-	elif [ -d "/mmfs1/gscratch/scrubbed" ]; then
-		SCRUBBED_DIR="/mmfs1/gscratch/scrubbed/${USER}"
-		case "${OLLAMA_MODELS:-}" in
-		/gscratch/*)
-			OLLAMA_MODELS="${OLLAMA_MODELS#/gscratch/}"
-			OLLAMA_MODELS="/mmfs1/gscratch/${OLLAMA_MODELS}"
-			;;
-		*) ;;
-		esac
+	if findmnt -M /gscratch -O rw -f 2>/dev/null || findmnt -M /mmfs1 -O rw -f 2>/dev/null; then
+		OLLAMA_MODELS="${OLLAMA_MODELS:-${USER_SCRUBBED_DIR:-/gscratch/scrubbed}/${USER}/ollama/models}}"
+		OLLAMA_MODELS_REAL="$(realpath "${OLLAMA_MODELS}")"
+		if ! mkdir -p "${OLLAMA_MODELS}" 2>/dev/null || ! { [ -d "${OLLAMA_MODELS}" ] && [ ! -w "${OLLAMA_MODELS}" ]; } && [ -d "/mmfs1" ]; then
+			case "${OLLAMA_MODELS_REAL}" in
+				/gscratch/*) OLLAMA_MODELS="/mmfs1${OLLAMA_MODELS_REAL}" ;;
+				*) ;;
+			esac
+		fi
 	fi
-	[ -n "${SCRUBBED_DIR:-}" ] && OLLAMA_MODELS="${OLLAMA_MODELS:-${SCRUBBED_DIR}/ollama/models}"
-	[ -n "${OLLAMA_MODELS:-}" ] && export OLLAMA_MODELS
-	
-	# If no arguments are given, default to running `ollama serve`:
-	[ $# -eq 0 ] && set -- serve
-	
-	# If the first argument is "serve",
-	# 	1. create the models directory if it doesn't exist
-	# 	2. write a descriptive message to stderr
-	if [ "${1:-}" = "serve" ]; then
-		if [ -z "${OLLAMA_PORT:-}" ]; then
-			echo "OLLAMA_PORT not set. Finding a random port to use" >&2
-			OLLAMA_PORT="$(get_random_port)"
-			if [ $? -ne 0 ]; then
-				echo "Failed to get random port. Exiting." >&2
-				exit 2
-			fi
+	if [ -n "${OLLAMA_MODELS:-}" ]; then
+		if [ -f "${OLLAMA_MODELS}" ]; then
+			echo "\"${OLLAMA_MODELS}\" is a file. Exiting." >&2
+			exit 1
 		fi
-	
-		if [ -n "${OLLAMA_MODELS:-}" ]; then
-			mkdir -p "${OLLAMA_MODELS}" || {
-				echo "Failed to create the OLLAMA_MODELS=\"${OLLAMA_MODELS}\" directory. Exiting." >&2
-				exit 1
-			}
+		if ! mkdir -p "${OLLAMA_MODELS}" 2>/dev/null; then
+			echo "Failed to create the directory \"${OLLAMA_MODELS}\". Exiting." >&2
+			exit 1
 		fi
-		echo "Starting ollama server${OLLAMA_HOST:+ on ${OLLAMA_HOST}}${OLLAMA_MODELS:+ with models in ${OLLAMA_MODELS}}." >&2
-	fi
-	
-	if [ "${1:-}" = "available" ]; then
-		if ollama available 2>&1 | grep -qE 'Error:.*unknown command'; then
-			shift
-			ollama-available-models "$@"
-			exit $?
+		if ! [ -r "${OLLAMA_MODELS}" ]; then
+			echo "The directory \"${OLLAMA_MODELS}\" is not readable. Exiting." >&2
+			exit 1
 		fi
+		if ! [ -w "${OLLAMA_MODELS}" ]; then
+			echo "The directory \"${OLLAMA_MODELS}\" is not writable. Exiting." >&2
+			exit 1
+		fi
+		export OLLAMA_MODELS
 	fi
-	# Run the ollama command with the given arguments:
-	ollama "$@"
-
-%startscript
-	/.run serve "$@"
+	# Set up OLLAMA_HOST
+	export OLLAMA_HOST="${OLLAMA_HOST:-127.0.0.1${OLLAMA_PORT:+:${OLLAMA_PORT}}}"
+	ollama serve || {
+		echo "Failed to start ollama. Exiting." >&2
+		exit 1
+	}
+	OLLAMA_PID=$!
+	trap 'kill -TERM $OLLAMA_PID' TERM INT
 
 %help
 	This is a simple container for running Ollama. For more information, visit https://ollama.ai.