Data Mining is compatible with Leopard, Snow Leopard, Lion and Mountain Lion.
Data Mining was developed to find the number of hits (string occurrences) within a large text. To use Data Mining, use the File>Open menu to load a plain text file or paste the plain text to be searched into the window, enter the strings (case-sensitive) into the fields (28 fields provided) and click on the "Start Mining" button. Hits will be shown to the left of each string. The contents of the 28 search fields are saved when you quit and reloaded when you relaunch. To use the Ad Hoc search capability, type the search string (case-sensitive) into the Ad Hoc String field and click on the "Ad Hoc" button. The number of hits will appear above the "Ad Hoc" button. The Ad Hoc String is not saved between sessions. There is also a Find capability. Type the search string (case-insensitive) into the Find field and click on the "Find" button. The window will scroll to that row (just above the red line in the middle of the window) and the search string will be highlighted. You can click on the "Find Next" button to step through all string occurrences. The total number of text characters is displayed. Click on the "Reset" button to clear the text window, Hits and the Ad Hoc and Find search strings. Click on the "Clear Hits" button to clear just the Hits. Data Mining is optimized for speed. A search of one million characters for 28 strings takes approximately one second. A Help window is available. This utility is freeware.
Mac Version 2.3.0 is compatible with Mountain Lion's default security, is Intel, and uses a new icon. Windows Version 2.1.3 uses an installer and is compatible with Win 7.
If you paste text into the window (instead of loading a text file), click on the window and click on the "Return" key five times and then paste the text. This will set the red line offset.
Displayed above, raw data of web site activity for a short period of time is mined for 14 strings. Plain text has been pasted into the text window and unique strings have been entered into the 14 fields. In this case, the text window has approximately 1.6 M characters. In the above case, there were 327 hits for the "mathpractice" string. The Ad Hoc string search for "games" matches the number of Hits for "games in the fourth string. The window is scrolled to row 2317 where the 23rd string of "mathpractice" is found using the Find capability. Note the Find String in the row above the red line in the window is highlighted. This string started at character 62899. On the Mac platform, clicking the arrows (or moving the handle) on the scrollbar results in jumps of more than one row. The jump size depends on number of rows. The "Fine Scroll Up" and "Find Scroll Down" buttons scroll the window one row at a time; the scroll row number is displayed when these buttons or the "Find" and "Find Next" buttons are used.
Above is an example of using Data Mining with non-English text having accented characters.
The Windows display is different from that shown above; the "Fine Scroll Up" and "Fine Scroll Down" buttons are not needed as the scrollbar moves one row at a time.
Click here to download Intel Version (1.8 MB) for Macintosh (OS 10.5 or later).
Click here to download the version (1.1 MB) for Win 7.