No Login Data Private Local Save

Sitemap URL Extractor - Online Get All Links from XML Sitemap

5
0
0
0

Sitemap URL Extractor

Extract all URLs from any XML Sitemap instantly. Parse sitemap index files, view metadata, and export results.

Quick examples: /sitemap.xml /sitemap_index.xml /post-sitemap.xml
You can also paste sitemap index XML

Click to upload or drag & drop

Supports .xml and .txt files

Frequently Asked Questions

A Sitemap URL Extractor is a tool that parses XML sitemap files and extracts all URLs contained within them. It reads the standard <urlset> and <sitemapindex> XML formats used by websites to list their pages for search engines, and outputs a clean, organized list of all links along with optional metadata like last modification date, change frequency, and priority.

You have three options:
1. Sitemap URL: Enter the full URL of an XML sitemap (e.g., https://yoursite.com/sitemap.xml) and click "Extract URLs". Note that due to browser CORS policies, some websites may block direct access. In that case, use one of the other methods.
2. Paste XML: Copy the raw XML content of a sitemap and paste it directly into the text area, then click "Parse XML". This always works regardless of CORS.
3. Upload File: Upload a saved .xml or .txt sitemap file from your computer.

Browsers enforce Cross-Origin Resource Sharing (CORS) for security reasons. When you try to fetch a sitemap from a different domain via JavaScript, the target server must include appropriate Access-Control-Allow-Origin headers. Many websites don't configure these headers for their sitemap files. Solutions: Use the "Paste XML" or "Upload File" tabs instead — these work 100% of the time without any CORS restrictions. Alternatively, if you're the website owner, you can configure your server to allow CORS for sitemap files.

A Sitemap Index file (root element <sitemapindex>) contains references to multiple individual sitemap files. Large websites use sitemap indexes to split their URL listings across several files (e.g., one for posts, one for pages, one for products). Each entry in a sitemap index points to a child sitemap URL. This tool detects sitemap indexes and lists all child sitemap references, which you can then fetch individually.

Common sitemap locations include:
• /sitemap.xml
• /sitemap_index.xml
• /sitemap-index.xml
• Check your robots.txt file (e.g., https://yoursite.com/robots.txt) — it often contains a Sitemap: directive.
• If you use WordPress with Yoast SEO, check /sitemap_index.xml.
• For Shopify stores, it's typically /sitemap.xml.

Standard XML sitemaps may include these optional fields for each URL:
• Last Modified (lastmod): The date the page was last updated.
• Change Frequency (changefreq): How often the page is expected to change (always, hourly, daily, weekly, monthly, yearly, never).
• Priority: A value from 0.0 to 1.0 indicating the page's importance relative to other pages on the site.
This tool extracts and displays all available metadata alongside each URL.

Yes! You can:
• Copy All: Copies all URLs to your clipboard (one per line).
• Export CSV: Downloads a CSV file with all columns (URL, Last Modified, Change Frequency, Priority) — perfect for spreadsheet analysis.
• Export TXT: Downloads a plain text file with one URL per line — great for feeding into other tools or scripts.

The tool runs entirely in your browser, so the limit depends on your device's memory. In practice, it comfortably handles sitemaps with 50,000+ URLs. For very large sitemaps, the results are paginated (100 URLs per page) to ensure smooth performance. The XML parsing uses the browser's native DOMParser, which is highly optimized. Sitemap index files are also fully supported — the tool will list all child sitemaps for further processing.