PySMBSearch is a crawler and search engine for SMB shares. It consists of a crawler script, which creates an index and stores it in an SQL database, and a CGI script that can be used to extract queries from the database.
Be the first to post a text review of PySMBSearch. Rate and review a project by clicking thumbs up or thumbs down in the right column.
I've been adding small features and enhancements to pysmbsearch since the last release (I should really get CVS up and running, so that people don't have to wait for the tarballs). Some things to expect: * Fixed a bug in the incremental updater that was making it try to index a bunch of nonexistent hosts. * Updated the name resolving code to run in parallel, rather than serially. This means that a) It runs in about 3 minutes on my PIII 500, rather than 10 minutes, and b) The CPU usage sits at 100% for all 3 minutes. I may introduce a command line switch to choose between the two methods, as both have their advantages. * Included two scripts I use, timed-index.sh and timed-inc.sh, which do stats collecting after index update. * Added VACUUM ANALYZE to the full-update script. In Postgres, old, deleted entries don't go away, but rather hang around and slow down queries until you run VACUUM. The ANALYZE helps the Postgres database engine optimize future queries. A little bit more needs to be cleaned up, but hopefully I can get that done this weekend. That'll bump this up to version 0.3; I think I'm going to then try to do some more major things like full database abstraction for the next version.
Got bored tonight and finished writing incremental updates, updated the README, cleaned up a bunch of stuff and decided it should be released (especially since the last version had some problems in it that probably would have prevented it from being really usable). So if anyone's watching this, enjoy :)
* Added a configuration file that the scripts read, smbsearch.conf * Added a script to create the database tables * Added incremental updates in the form of a new script, gen-incremental.sh * Fixed a few minor bugs
Check it out in the download area. Next planned feature is incremental index updates, and then true database-agnostic behavior.
Be the first person to add a text review.
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?