soti-media-organizer
v1.0.0
Published
A powerful CLI tool to organize photos and videos based on their creation date and metadata, with support for duplicate detection and customizable directory structure.
Downloads
9
Maintainers
Readme
📸 SotiMediaOrganizer
Organize your media collection with ease! SotiMediaOrganizer is a powerful command-line tool that helps you sort and manage your digital photos and videos based on their creation date and metadata.
✨ Features
- 🗂 Organize photos and videos into a structured directory hierarchy (YYYY/MM/DD)
- 📅 Advanced date extraction from multiple sources, including EXIF data and file metadata
- 🔍 Perceptual hashing for detecting and handling duplicate files
- 🚀 Process multiple source directories simultaneously
- ⚡ Concurrent processing for improved performance
- 🛠 Customizable settings via command-line arguments
- 🔧 Extensive customization for target directory structure
🚀 Installation
Option 1: Using Node.js
Ensure you have Node.js (version 14 or higher) installed on your system.
Clone this repository:
git clone https://github.com/shtse8/SotiMediaOrganizer.git cd SotiMediaOrganizer
Install the required dependencies:
npm install
Option 2: Using Bun (Recommended for better performance)
Install Bun if you haven't already:
curl -fsSL https://bun.sh/install | bash
Clone this repository:
git clone https://github.com/shtse8/SotiMediaOrganizer.git cd SotiMediaOrganizer
Install the required dependencies:
bun install
🔧 Usage
With Node.js
Run SotiMediaOrganizer using the following command structure:
node SotiMediaOrganizer.js -s <source_dirs...> -t <target_dir> [options]
With Bun (Recommended)
Run SotiMediaOrganizer using Bun with the following command structure:
bun run SotiMediaOrganizer.js -s <source_dirs...> -t <target_dir> [options]
Required Arguments
-s, --source <paths...>
: Specify one or more source directories to process-t, --target <path>
: Specify the target directory for organized media
Optional Arguments
-e, --error <path>
: Directory for files that couldn't be processed (default:./error
)-d, --duplicate <path>
: Directory for duplicate files (default:./duplicate
)--debug <path>
: Directory for storing all files in duplicate sets for debugging-w, --workers <number>
: Number of concurrent workers (default: 5)-m, --move
: Move files instead of copying them (default: false)-r, --resolution <number>
: Resolution for perceptual hashing (default: 64)--frame-count <number>
: Number of frames to extract from videos for perceptual hashing (default: 5)-h, --hamming <number>
: Hamming distance threshold for perceptual hashing (default: 10)-f, --format <string>
: Format for target directory structure (default:{D.YYYY}/{D.MM}/{D.DD}/{NAME}.{EXT}
)
Examples
Basic usage with one source directory (using Bun):
bun run SotiMediaOrganizer.js -s ~/Pictures/Unsorted -t ~/Pictures/Organized
Multiple source directories with custom error and duplicate folders (using Bun):
bun run SotiMediaOrganizer.js -s ~/Downloads ~/Desktop/Photos -t ~/Pictures/Organized -e ~/Pictures/Errors -d ~/Pictures/Duplicates
Increase the number of concurrent workers for faster processing (using Bun):
bun run SotiMediaOrganizer.js -s ~/Pictures/Unsorted -t ~/Pictures/Organized -w 10
Customize the directory structure with format strings:
bun run SotiMediaOrganizer.js -s ~/Pictures/Unsorted -t ~/Pictures/Organized -f "{D.YYYY}/{D.MM}/{D.DD}/{NAME}.{EXT}"
📋 How It Works
Stage 1: File Discovery
The tool scans the specified source directories for supported image and video files. It discovers files recursively and logs the progress, allowing concurrent processing of directories.
Stage 2: Deduplication
For each file, the tool attempts to extract a unique identifier (hash) and a perceptual hash for images and videos. Files are compared to identify duplicates using a combination of exact hashing and perceptual hashing with customizable hamming distance thresholds.
Stage 3: File Transfer
Files are then moved or copied to the target directory, organized into a customizable folder structure. Duplicate files are handled intelligently, with options to move them to a specified directory or keep the best version.
Date Extraction Process
The tool uses a comprehensive approach to extract the most accurate creation date for each file:
- EXIF Data: Attempts to read EXIF metadata using the ExifTool library. It prioritizes tags such as
DateTimeOriginal
,CreateDate
, andMediaCreateDate
. - File Metadata: If EXIF data is unavailable or invalid, it uses the file's last modified date.
- Fallback: If all above methods fail, the file is moved to an error directory.
Directory Structure Customization
The target directory structure is highly customizable using format strings. Here are some examples of format placeholders:
{D.YYYY}
- Year from mixed date (image or file date){D.MM}
- Month from mixed date{D.DD}
- Day from mixed date{NAME}
- Original filename without extension{EXT}
- File extension{RND}
- Random 8-character hexadecimal string{CAM}
- Camera model{HAS.GEO}
- 'GeoTagged' or 'NoGeo'{HAS.DATE}
- 'Dated' or 'NoDate'
Example format strings:
{D.YYYY}/{D.MM}/{D.DD}/{NAME}.{EXT}
{HAS.GEO}/{HAS.CAM}/{D.YYYY}/{D.MM}/{NAME}_{D.HH}{D.mm}.{EXT}
{TYPE}/{CAM}/{D.YYYY}/{D.MM}/{D.DD}_{D.HH}{D.mm}_{NAME.U}.{EXT}
🥇 Why Choose SotiMediaOrganizer?
SotiMediaOrganizer stands out from other media organization tools for several reasons:
- Comprehensive Date Extraction: The tool uses a multi-step process to extract dates, ensuring more accurate organization even for files with missing or corrupt metadata.
- Duplicate Handling: It intelligently manages duplicates using exact and perceptual hashing, preserving the best files.
- Customization: The directory structure and file handling are highly customizable to fit specific organizational needs.
- Performance: The tool utilizes concurrent processing and supports Bun for improved performance.
- Wide Format Support: It handles a broad range of image and video formats, making it suitable for diverse media collections.
- Non-Destructive: The tool moves files rather than copying them, preserving storage space and maintaining the original files' integrity.
- Open Source: Transparent and customizable, unlike many proprietary solutions.
🖼 Supported File Types
SotiMediaOrganizer supports a wide range of file extensions:
Image Formats
.jpg
,.jpeg
,.png
,.gif
,.webp
,.tif
,.tiff
,.bmp
,.heic
,.heif
,.avif
- RAW formats:
.cr2
,.cr3
,.nef
,.arw
,.dng
, and others
Video Formats
.mp4
,.m4v
,.mov
,.avi
,.mpg
,.mpeg
,.wmv
,.webm
, and others
🤝 Contributing
Contributions, issues, and feature requests are welcome! Feel free to check the issues page.
📜 License
This project is licensed under the MIT License. See the LICENSE file for details.
🙏 Acknowledgements
- ExifTool for EXIF data extraction
- Commander.js for command-line argument parsing
- Bun for providing a fast JavaScript runtime alternative
Happy organizing! 📸✨