Name Last modified Size Description
Parent Directory - CHANGES 2004-05-18 00:00 6.8K CONTRIBUTIONS 2004-05-18 00:00 4.2K COPYRIGHT 2004-05-18 00:00 1.1K Makefile 2004-05-18 00:00 5.2K Makefile.NeXT 2004-05-18 00:00 8.2K Makefile.alpha 2004-05-18 00:00 8.3K Makefile.hp 2004-05-18 00:00 8.2K Makefile.in 2004-05-18 00:00 5.1K Makefile.linux 2004-05-18 00:00 8.3K Makefile.rs6000 2004-05-18 00:00 8.3K Makefile.sgi 2004-05-18 00:00 8.3K Makefile.solaris 2004-05-18 00:00 8.3K Makefile.sunos 2004-05-18 00:00 8.3K README 2004-05-18 00:00 5.9K README.install 2004-05-18 00:00 5.8K agrep/ 2013-07-24 19:06 - bin/ 2013-07-24 19:06 - communicate.c 2004-05-18 00:00 9.3K compress/ 2013-07-24 19:06 - config.cache 2004-05-18 00:00 1.1K config.status 2004-05-18 00:00 3.6K configure 2004-05-18 00:00 31K configure.in 2004-05-18 00:00 733 defs.h 2004-05-18 00:00 1.3K get_filename.c 2004-05-18 00:00 12K get_index.c 2004-05-18 00:00 46K glimpse.1 2004-05-18 00:00 33K glimpse.chronicle 2004-05-18 00:00 2.2K glimpseindex.1 2004-05-18 00:00 23K glimpseserver.1 2004-05-18 00:00 2.9K index/ 2013-07-24 19:07 - install-sh 2004-05-18 00:00 4.7K lib/ 2013-07-24 19:07 - libtemplate/ 2013-07-24 19:08 - main.c 2004-05-18 00:00 96K mkinstalldirs 2004-05-18 00:00 651 split.c 2004-05-18 00:00 19K
GLIMPSE 3.0: searching entire file systems Glimpse version 3.0 improves the original version in many ways. The main changes are listed below. Glimpse is a very powerful indexing and query system that allows you to search through all your files very quickly. It can be used by individuals for their personal file systems as well as by organizations for large data collections. Glimpse is the default search engine in Harvest (see below). Glimpseindex, which you run by saying "glimpseindex DIR" builds an index of all text files in the tree rooted at DIR. (e.g., glimpseindex ~ indexes all your files.) With it, glimpse can search through all files much the same way as agrep (or any other grep), except that you don't have to specify file names and the search is fast. For example, glimpse -1 unbelievable will find all occurrences (in all your files!) of "unbelievable" allowing one spelling error; glimpse -F mail arizona will find all occurrences of "arizona" in all files with "mail" somewhere in their name; glimpse 'Arizona desert;windsurfing' will find all lines that contain both "Arizona desert" and "windsurfing". Glimpse supports three types of indexes: a tiny one (2-3% of the size of all files), a small one (7-9%), and a medium one (20-30%). The larger the index the faster the search. Glimpse supports most of agrep's options (agrep is our powerful version of grep, and it is part of glimpse) including approximate matching (e.g., finding misspelled words), Boolean queries, and even some limited forms of regular expressions. The WWW home page for glimpse is in http://glimpse.cs.arizona.edu:1994/ HTML version of glimpse manual pages is in http://glimpse.cs.arizona.edu:1994/glimpsehelp.html HTML version of glimpseindex manual pages is in http://glimpse.cs.arizona.edu:1994/glimpseindexhelp.html HTML version of glimpseserver manual pages is in http://glimpse.cs.arizona.edu:1994/glimpseserverhelp.html The complete source code for glimpse and glimpseindex, as well as manual pages and other stuff can be obtained from ftp://cs.arizona.edu/glimpse/glimpse-3.0.src.tar.Z Sparc SunOS 4.1.1 executables are in ftp://cs.arizona.edu/glimpse/glimpse-3.0.bin.sunos.4.1.1.tar.Z Sparc SunOS 4.1.3 executables are in ftp://cs.arizona.edu/glimpse/glimpse-3.0.bin.sunos.4.1.3.tar.Z Solaris executables are in ftp://cs.arizona.edu/glimpse/glimpse-3.0.bin.solaris.tar.Z Dec Alpha OSF/1 executables are in ftp://cs.arizona.edu/glimpse/glimpse-3.0.bin.alpha.tar.Z There are also binaries for AIX, SGI, Linux, and HP. (We thank Heinrich Stamerjohanns, Chris Dalton, [email protected], Jim Hurley, and Piroz Mohseni for those.) More binaries will be posted as they become available. An article describing the ideas behind the design of glimpse is in ftp://cs.arizona.edu/glimpse/glimpse.ps.Z GlimpseHTTP home page is in http://glimpse.cs.arizona.edu:1994/ghttp/ (GlimpseHTTP is a Glimpse add-on tool to provide full-text search for your WWW server) Harvest's WWW home page is http://harvest.cs.colorado.edu/ (Harvest is an integrated set of tools to gather, extract, organize, search, cache, and replicate relevant information across the Internet.) Mail [email protected] to be added to the glimpse mailing list. Mail [email protected] to report bugs, ask questions, discuss tricks for using glimpse, etc. (This is a moderated mailing list.) Udi Manber, Burra Gopal, and Sun Wu. Main changes and additions in version 3.0: 2.1 ---> 3.0 - added a data structure (in .glimpse_turbo) that speeds up queries using -w and -i considerably for large indexes. It is meant mostly for servers using glimpse (e.g., Harvest and glimpseHTTP servers), but it benefits everyone. With this "turbo" option, typical queries take less than a second even for very large indexes. This was so successful that we made it the default rather than an option (it used to be -T in some earlier versions). If the .glimpse_turbo file is deleted, glimpse will still work properly (but glimpseindex -f and -a require it). - incremental indexing is now fully supported (even for -b). Deletion from the index is supported. glimpseindex -d filename(s) completely deletes the files from the index; glimpseindex -D filename(s) deletes the files only from the file list. - the index has been improved in several ways (transparently except for speed and space). As a result, indices built with earlier versions of glimpseindex will not work with 3.0 -- you must reindex again. - several options were added to glimpseindex and glimpse: glimpseindex -E indexes all file without attempting to run the filetype filtering (but excluded files or suffixes still apply). glimpse -Q extends -N in a nice way giving much more information about the matches in the index. glimpse -L has more options: -L x | x:y | x:y:z if one number is given, it is a limit on the total number of matches. Glimpse outputs only the first x matches. If two numbers are given (x:y), then y is an added limit on the total number of files. If three numbers are given (x:y:z), then z is an added limit on the number of matches per file. If any of the x, y, or z is set to 0, it means to ignore it (in other words 0 = infinity in this case); for example, -L 0:10 will output all matches to the first 10 files that contain a match. (There are also some undocumented-as-yet options. We are running out of letters. Only -j and -Y are not used!) - glimpse 3.0 still has a LOT of makefiles (one per architecture / OS). We hope to include autoconf support for glimpse in the future: but these should be sufficient for most purposes. - several bugs were fixed, and the whole package is now more portable. Binaries and make files for the following platforms are now available: AIX-3.2.5, HPPA, HPMC68K, IBM-RS6000, Linux, SGI. (Binaries and make files for SUNOS4.1.1, SUNOS4.1.3, SOLARIS 5.3 and DEC OSF/1 (ALPHA) are avaialable as usual.) See README.install for more details.