Changelog
v0.1.10 (2026-06-07)
Fixed:
Development Statusclassifier set to4 - Beta(was3 - Alpha)
v0.1.9 (2026-06-07)
Added: Project homepage link to
pyproject.tomlandREADME.md— https://abdulaouwal.com/project/llms-generator/Changed: Bumped version from 0.1.8 to 0.1.9
v0.1.7 (2026-06-06)
(This release was published directly to PyPI without a formal changelog entry.)
v0.1.6 (2026-06-06)
Updated: README cleaned up. Removed em dashes and AI-sounding phrases. Added Read the Docs badge and documentation link.
v0.1.5 (2026-06-06)
Added: Changelog URL in pyproject.toml so the changelog link appears on PyPI
v0.1.4 (2026-06-06)
Fixed: Redirected URLs now recorded in
llms.txtby their final URL instead of the original pre-redirect URLFixed: Link extraction uses the page’s final URL as base for resolving relative links after redirects
v0.1.3 (2026-06-06)
Fixed: Duplicate page entries in
llms.txtcaused by trailing-slash variants andhttp/httpsscheme variants. URLs are now normalized before deduplication.
v0.1.2 (2026-06-06)
Fixed:
USER_AGENTnow reads from__version__. Stays in sync automatically.Fixed:
X-Robots-Tag: nofollowis now respected. Header-level and meta-level directives are merged.Fixed: Playwright browser instance properly cleaned up on launch failure (resource leak)
Fixed:
requests.Sessionis now explicitly closed after crawl
v0.1.1 (2026-06-06)
Fixed: robots.txt returning 403/blocked no longer kills the entire crawl. Gracefully falls back to allow-all.
Fixed:
--fullflag now generates separatellms.txt(summary) andllms-full.txt(full content) as specifiedFixed: URL fragment stripping no longer corrupts paths (
str.rstripreplaced with proper split)Fixed:
<h1>text no longer overrides URL-path-based section groupingFixed: Playwright fallback no longer triggered on 404/500 errors. Only on empty JS-rendered content.
Optimized: Playwright browser instance reused across all JS fallback fetches
Optimized: HTML parsed once per page instead of three times
Fixed:
requirements.txtno longer forces Playwright installRemoved: Dead
isinstance(href, (list, tuple))branch and unused regex
v0.1.0 (2026-06-06)
Initial release