🌡️ LLM Thermometer

Estimate temperature values of Large Language Models from semantic similarity of generated text

Research Question

Is it possible to infer the temperature parameter value used by an LLM from only the generated text?

Probably yes.

Similarity between generated texts with same temperature level (various colors) from the prompt:
"What will technology look like in 2050?"

Approach

LLM Thermometer uses semantic similarity between multiple responses to estimate temperature:

Generation: Produce multiple responses from an LLM using the same prompt
Similarity Analysis: Measure semantic similarity between responses
Temperature Estimation: Infer temperature based on response diversity
- Higher temperature → More diverse responses (lower similarity)
- Lower temperature → More consistent responses (higher similarity)

The reports, hosted on GitHub Pages, contains experiments metadata, charts, and tables.

Usage

# Set required environment variables
export LLM_API_KEY="your_api_key"
export LLM_BASE_URL="https://api.provider.com/v1"
export EMB_API_KEY="your_embedding_api_key"
export EMB_BASE_URL="https://api.provider.com/v1"

# Generate samples
llm-thermometer generate \
 --language-model "model-name" \
 --prompt "What will technology look like in 2050?" \
 --samples 32 \
 --data-dir ./data \
 --temperature 0.7 \

# Measure semantic similarity
llm-thermometer measure \
 --embedding-model "embedding-model-name" \
 --data-dir ./data

# Generate report
llm-thermometer report \
 --data-dir ./data \
 --docs-dir ./docs

# Or using Makefile...
make generate
make measure
make report
make docs

Installation

The preferred way to install llm-thermometer is using uv (although you can also use pip).

# Clone the repository
git clone https://github.com/S1M0N38/llm-thermometer.git
cd llm-thermometer

# Create a virtual environment
uv init

# Install the package
uv sync

Models Local Deployment with Docker

If you have a GPU available, you can run both the Language Model and embedding model locally using docker-compose:

# Set HF_HOME environment variable for model caching
export HF_HOME="/path/to/huggingface/cache"

# Start the models
docker-compose up -d

# Language model will be available at http://localhost:41408
# Embedding model will be available at http://localhost:41409

Requirements

Python 3.12+
OpenAI-compatible API endpoints (/chat/completions and /embeddings)
NVIDIA GPU (for local deployment with docker-compose)

Contributing

This research project is still in its early stages, and I welcome any feedback, suggestions, and contributions! If you're interested in discussing ideas or have questions about the approach, please start a conversation in GitHub Discussions.

For detailed information on setting up your development environment, understanding the project structure, and the contribution workflow, please refer to CONTRIBUTING.md.

Name	Name	Last commit message	Last commit date
Latest commit S1M0N38 chore(docs): auto generated reports Mar 5, 2025 0f25369 · Mar 5, 2025 History 133 Commits
.github/workflows	.github/workflows	style: remove comments and update title for gh-action	Mar 3, 2025
docs	docs	chore(docs): auto generated reports	Mar 5, 2025
src/llm_thermometer	src/llm_thermometer	chore(release): 0.5.2 → 0.6.0	Mar 5, 2025
.gitattributes	.gitattributes	chore(git): add .gitattributes for git-lfs	Mar 2, 2025
.gitignore	.gitignore	chore(git): add data to .gitignore	Feb 27, 2025
.pre-commit-config.yaml	.pre-commit-config.yaml	chore(git): remove end of line pre-commit hook	Mar 2, 2025
.python-version	.python-version	chore: add .python-version	Feb 27, 2025
CHANGELOG.md	CHANGELOG.md	chore(release): 0.5.2 → 0.6.0	Mar 5, 2025
CITATION.cff	CITATION.cff	chore(release): 0.5.2 → 0.6.0	Mar 5, 2025
CONTRIBUTING.md	CONTRIBUTING.md	docs: update contributing guidelines	Mar 4, 2025
Makefile	Makefile	style(makefile): add echo cmds to generate recipe	Mar 4, 2025
README.md	README.md	docs: update contributing guidelines	Mar 4, 2025
docker-compose.yml	docker-compose.yml	build(docker): use text-embedding-inference for local embedding model	Mar 3, 2025
pyproject.toml	pyproject.toml	chore(release): 0.5.2 → 0.6.0	Mar 5, 2025
uv.lock	uv.lock	chore(release): 0.5.2 → 0.6.0	Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌡️ LLM Thermometer

Research Question

Approach

Usage

Installation

Models Local Deployment with Docker

Requirements

Contributing

About

Languages

S1M0N38/llm-thermometer

Folders and files

Latest commit

History

Repository files navigation

🌡️ LLM Thermometer

Research Question

Approach

Usage

Installation

Models Local Deployment with Docker

Requirements

Contributing

About

Resources

Citation

Stars

Watchers

Forks

Languages