Weblog entry #187 for Steve
The previously described and much improved site search facility has now been made live.
On the one hand this is good, on the other hand it is now an external dependency - so the code behaves differently upon the live site and my test installs at home. Ho hum.
Bug reports welcome. I'll setup the spider to reindex the site on a semi-daily basis...
Comments on this Entry
Minor annoyance: The search often returns results ending in "/print". I for one would prefer if you'd not spider those.
BR,
/MBL
[ Parent | Reply to this comment ]
[ Send Message | View Steve's Scratchpad | View Weblogs ]
Well spotted, I will remove those.
[ Parent | Reply to this comment ]
[ Send Message | View dkg's Scratchpad | View Weblogs ]
The links are all full-URI links, though, which means when accessing them via https, if you click on the results, you'll get taken back to the non-TLS version of the site. Could they be made absolute links (i.e. without the protocol and hostname)?
Also, it would be nice to see them integrated with the rest of the site trimming (sidebars, etc). That way, if the search results don't show what you want, you can still get to the other site goodies.
[ Parent | Reply to this comment ]
[ Send Message | View Steve's Scratchpad | View Weblogs ]
I hadn't noticed the http vs. https thing - but I guess there isn't a neat solution to that, except to have the 'force SSL' checkbox on and use the advanced login.
(That is because this is a spider-based program, and it spiders only HTTP. If it did both it wouldn't solve the problem, because it would be randomly returning links from http & https).
As for site-integration I'm going to work on improving that. The big issue is that the search results page is static; so I can't easily inject the dynamic side-bars - but I should be able to add the header at least.
[ Parent | Reply to this comment ]
[ Send Message | View dkg's Scratchpad | View Weblogs ]
[ Parent | Reply to this comment ]
[ Send Message | View Steve's Scratchpad | View Weblogs ]
Not too easily. I'd have to have the main CGI script which powers the site invoke the search script as another CGI, and then marshall stuff back and forth.
For marginal gain I'm not sure the effort is worthwhile..
[ Parent | Reply to this comment ]
[ Send Message | View dkg's Scratchpad | View Weblogs ]
[ Parent | Reply to this comment ]
[ Send Message | View Steve's Scratchpad | View Weblogs ]
Good catch. I'll update the spider.
[ Parent | Reply to this comment ]