File Encoding Checker - Online Detect Text File Charset
Upload a text file to detect its character encoding (UTF-8, ISO-8859-1, etc.) and BOM presence. Runs entirely in your browser.
UD5 Toolkit
Remove invalid characters from file names ā Windows, macOS & Linux compatible
Windows prohibits these characters in file names: < > : " / \ | ? * along with ASCII control characters (0-31, 127). Additionally, file names cannot end with a space or period, and certain reserved device names like CON, PRN, AUX, NUL, COM1-COM9, LPT1-LPT9 are not allowed regardless of extension.
macOS and Linux are more permissive. The only characters strictly forbidden are / (forward slash) and the null character (\0). However, on macOS, the Finder may interpret : (colon) as a path separator, converting it to / behind the scenes. It's best practice to avoid colons on macOS too.
CON is a reserved device name in Windows dating back to MS-DOS. It refers to the console (keyboard/screen). Other reserved names include PRN (printer), AUX (auxiliary device), NUL (null device), and COM1-COM9 / LPT1-LPT9 (serial/parallel ports). These names are reserved regardless of file extension ā so CON.txt or NUL.pdf are also invalid.
On Windows with NTFS, individual file names can be up to 255 characters long. On macOS (APFS/HFS+) the limit is 255 UTF-8 characters. Linux (ext4) allows 255 bytes. The total path length on Windows is traditionally limited to 260 characters (MAX_PATH), though this can be extended in modern Windows versions. This tool offers an optional max-length setting to keep your file names within safe limits.
Underscores (_) are more universally compatible across all operating systems and programming languages. Hyphens (-) look cleaner in URLs and are preferred for web-facing files. Both are excellent choices. Avoid using spaces in file names intended for web servers, command-line tools, or cross-platform sharing ā spaces often require escaping and can break scripts.
Modern operating systems support Unicode characters (including emoji) in file names. This tool preserves Unicode characters and only removes or replaces characters that are explicitly prohibited by the target operating system. However, for maximum compatibilityāespecially with older software, FTP servers, or cloud storageāyou may want to additionally restrict non-ASCII characters.
Use descriptive, concise names. Stick to letters (a-z, A-Z), numbers (0-9), underscores, hyphens, and single periods for extensions. Avoid special characters, leading hyphens, and overly long names. For date-based files, use YYYY-MM-DD format for natural sorting. Example: project-report_2024-01-15_v2.pdf
Yes! This file name sanitizer is completely free and runs entirely in your browser. No data is uploaded to any server ā all processing happens locally on your device, ensuring your file names remain private and secure.
Upload a text file to detect its character encoding (UTF-8, ISO-8859-1, etc.) and BOM presence. Runs entirely in your browser.
Convert any text into JavaScriptāstyle \uXXXX escape sequences and vice versa. Handles emojis. Useful for i18n development.
Test how nonāLatin characters (Chinese, Cyrillic, Arabic) convert to URLāsafe slugs with proper transliteration rules. Preview the final string.
Paste text and instantly see if it contains hidden zeroāwidth characters often used in steganography. Reveal invisible payloads.
Paste rich text and clean it to plain text. Normalize line endings and whitespace. Prepare for code or databases.
Paste your CSS and strip all `!important` declarations in one click. See a list of affected rules. Local tool.
Drop a file and see its detected type based on the first bytes (magic number). Identifies hundreds of formats. Local.
Check English spelling and get suggestions using the browser's built-in dictionary. Highlight errors instantly. No data leaves your machine.
Upload multiple text files and concatenate them into one, with optional separators. No upload; processed instantly.
Enter a human name and translate it into a weird alien-sounding version using letter replacement rules.
Convert normal text to Unicode mathematical bold, italic, script, fraktur, and doubleāstruck. Copy rich text for anywhere.
Drop a file to see its MIME type and the first few magic bytes (hex and ASCII). No upload, works instantly.
Practice positive/negative lookahead and lookbehind. See matches highlighted live. Master advanced regex.
Enter a URL and a userāagent to see if it is allowed or blocked by the robots.txt file. Quick bot validation.
Remove the original audio from a video and attach a new soundtrack. Or extract audio. All processed locally.
Paste an enciphered text and instantly see all 25 possible shifts. Highlight the most plausible.
Paste text and quickly highlight potential proper nouns (capitalized sequences). Aid for editing or anonymization. Simple regex-based, local.
Turn your webcam feed into realātime ASCII art. Adjust density and character set. Fun for streams and demos. No upload.
Test different lineābreak and wordābreak values on Chinese/Japanese/Korean text. See how browsers wrap. Essential for i18n.
Paste two strings that look the same and see if they differ after normalization. Debug invisible encoding bugs.
Normalize text to any Unicode normalization form to prevent encoding bugs and ensure consistent comparison. Essential for i18n developers.
Create scary, glitched text using combining diacritical marks. Control intensity. Perfect for horror memes and fun. Pure Unicode magic in browser.
Easily apply ROT13 cipher to obfuscate or reveal text. A classic letter substitution cipher that works bidirectionally. Purely client-side processing.
Drop an image that might have wrong extension and see its real format (JPEG, PNG, WebP) based on header bytes.
Set start and end times for a video and trim it without re-encoding. Download the clip. Pure browser.
Type or paste Korean Hangul and see the revised romanization. Also works backwards for basic words. Study aid.
Shows each character's 7ābit or 8ābit binary representation. Includes space separation. For learning binary encoding.
Paste text and detect its language (70+ languages) using a simple character n-gram model implemented in JavaScript. No server communication.
Generate a random first and last name typical for US, UK, or Australia. With optional middle initial. Useful for test data.
Format a list of Q&A pairs and export as a CSV ready for Anki import. Simple lineābased template. Local.