| > httrack failed to mirror a site, in the log it says:
> Warning: File not parsed, looks like binary:....
> under what situations would httrack flag this warning?
Httrack always checks quicky how the html file 'looks
like'. If there are too many 'low characters' (for example,
NULL characters, or most ascii < 32), the file is
considered as 'binary'
This prevent the engine from scanning gif files,
incorrectly tagged by bogus servers as 'text/html'.
The ceil might be too strict, however - can you give the
bogus URL? | |