imgdown

Note: I built this tool to help me yank files off my CDN into Hugo Page Bundles

A Rust utility that automatically downloads images referenced in text-based files like HTML, Markdown, and CSS documents.

Features

Processes individual files or entire directories recursively
Supports multiple file formats:
- HTML (.html, .htm)
- Markdown (.md)
- CSS (.css)
- Plain text (.txt)
- XML (.xml)
Handles various image formats: JPG, JPEG, PNG, SVG, and WebP
Supports both relative and absolute URLs
Maintains original file structure
Skips existing files to avoid duplicate downloads
Follows symbolic links when scanning directories

Prerequisites

Rust (latest stable version)
Cargo package manager

Dependencies

[dependencies]
tokio = { version = "1.0", features = ["full"] }
reqwest = "0.11"
anyhow = "1.0"
regex = "1.0"
url = "2.0"
walkdir = "2.0"

Installation

Clone the repository:

git clone [repository-url]
cd imgdown

Build the project:

cargo build --release

The compiled binary will be available in target/release/.

Usage

The application can process either a single file or an entire directory:

# Process a single file
./imgdown path/to/file.html

# Process an entire directory
./imgdown path/to/directory

Example

./imgdown ./docs/blog

This will:

Scan all supported files in the ./docs/blog directory
Find image references in these files
Download the images to the same directory structure as their referencing files
Skip any images that have already been downloaded

How It Works

The program accepts a file or directory path as input
For directories, it recursively scans for supported file types
For each file, it:
- Reads the content
- Uses regular expressions to find image references
- Downloads images from valid URLs
- Preserves the directory structure
- Skips existing files

Error Handling

Invalid paths result in appropriate error messages
Download failures are logged but don't stop the process
File access issues are reported with detailed error messages

Limitations

Only processes files with supported extensions
Requires valid URL formatting in source files
Does not validate image content
Does not process JavaScript-generated image references

Contributing

Contributions are welcome! Here are some ways you can contribute:

Report bugs
Suggest new features
Add support for more file types
Improve error handling
Enhance documentation

License

MIT License

Authors

Chris Short chrisshort@duck.com

Acknowledgements

Created using Anthropic Claude 3.5 Sonnet

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
src		src
target		target
.gitignore.md		.gitignore.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

imgdown

Features

Prerequisites

Dependencies

Installation

Usage

Example

How It Works

Error Handling

Limitations

Contributing

License

Authors

Acknowledgements

About

Releases 2

Languages

License

chris-short/imgdown

Folders and files

Latest commit

History

Repository files navigation

imgdown

Features

Prerequisites

Dependencies

Installation

Usage

Example

How It Works

Error Handling

Limitations

Contributing

License

Authors

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Languages