| HTTrack's filtering system is incredibly powerful for controlling what gets
downloaded.
Include/Exclude URL Patterns:
+URL_pattern: Only download URLs that match this pattern.
-URL_pattern: Do NOT download URLs that match this pattern.
Example for Dynamic URLs: If a site uses ?session=123 in its URLs but the
content is the same, you might use a filter like -*?session=* to prevent
HTTrack from trying to download every unique session ID.
Example for APIs: If you notice API calls like example.com/api/data?id=...
that HTTrack isn't picking up, you might need to add an +example.com/api/*
filter to specifically include them, if HTTrack can then process the JSON/XML
data.
MIME Type Filters: You can specify which file types to include or exclude
(+mime:text/html, -mime:application/pdf). This is crucial if a dynamic script
might serve different MIME types, and you only want the HTML, for example.
Size Filters: +*[<100] (include files smaller than 100KB) or -*[>1MB] (exclude
files larger than 1MB). This can help avoid downloading huge media files you
don't need, which might be dynamically loaded.
Scan Rules Tab (GUI): In the WinHTTrack GUI, the "Set Options" -> "Scan Rules"
tab is where you manage these filters. It's an intuitive way to build complex
rules.
< <https://geometrydash-lite.io> > | |