I’ve cleaned up the ht://dig search engine quite a bit, though I still have to hack it to be XHTML 1.1 compliant and fit better with the site, but after updating the index I got these interesting stats:
./htstat
htstat: Total documents: 11490
htstat: Total words: 1933258
htstat: Total unique words: 17660
The bulk of those pages are from the photolog, which is currently 5200 photos, but before the index was huge because my URLs weren’t very tidy and sometimes the same thing would have 10 different ways to access it. That’s still true a bit, but I’m working on it. Also I’ve been working very hard on a new project, but I’m not sure if I’m supposed to talk about it yet.