Welcome

Data Sets
Overview
Creating New
--ASCII Text
--XML
--Google Harvest
--Web Harvest
Settings
--Fields
--Stopwords
--Stopmajors
--Punctuation Rules
Editing
Merging
Exporting
Importing
Subsetting

Visualizations
Galaxy
--Basics
--Outliers
ThemeView
Settings

Tools
Document Viewer
Gist
Groups
--Basics
--Evidence Panel
Major Terms
Queries
Print
Probe
Time Slicer

About version 2.2
Overview
Known issues

Gist

gist window

Accessing the Gist Tool

gist tool button Click the Gist button on the toolbar, or
  From the Windows menu, choose Gist

To “gist” a set of documents

When it is launched, Gist will “get the gist” of all documents that are selected and display a list of words and how many documents they are found in, ordered by frequency. To gist a different set of documents:

  1. Select the documents. Ways to select documents.
  2. On the Gist window, click the Gist Selected Documents button.

To see the gist of only the fields used for computing topical content and document cluster assignments,

Click the Computation Fields radio button.

To sort results alphabetically or numerically

  • To sort items in ascending order, click on a column header.
    [Indicated by 1+ following the column title.]
  • To sort in descending order, click on a column sorted in ascending order. [Indicated by 1- following the column title.]
  • To stop sorting, ALT-click on any column header.

For a secondary sort

Suppose you want to sort items by date and then within a given date, to sort them alphabetically. The date sort is the primary sort; the alphabetical sort is the secondary sort. In general, specify the secondary sort first, like this:

  1. Click or SHIFT-click on column for the secondary sort.
    [1+ or 1- will appear following the column title.]
  2. Then sort the column for the primary sort. [1+ or 1- will appear following the primary sort column title, and 2+ or 2- will appear following the secondary sort column title.]

To construct a query using words in Gist’s word list

  1. Select the word(s) you want to query on from Gist's word list. To select a single work, click on it. Use SHIFT-click to select a group of contiguous words, or CTRL-click to select non-contiguous words.
  2. Click Add Words. The words are copied to the text area below the word listbox.
  3. Highlight the words in the text area and press CTRL-C to copy them to the clipboard.
  4. Open the Queries Tool.
  5. Click in the Query Text box. Paste the words into the Query Text box using CTRL-V.

To add additional words to those previously selected

  1. Click on the words to select them, as before.
  2. Click the Add Words… button. They will be added to those previously selected.
  3. To erase all terms in the text area, click Clear.

Finding words that should be stopwords and adding them to the Stopwords list

  1. Select all documents in the data set.
  2. Gist them.
  3. Now, select terms that appear in all or almost all documents. Click on a word to select it. Use SHIFT-click to select a group of contiguous words, or CTRL-click to select non-contiguous words.
  4. Click Add Words. The words are copied to the text area below the word listbox.
  5. Highlight the words in the text area and press CTRL-C to copy them to the clipboard.
  6. Access the Stopwords panel for the data set, in the data set editor.
  7. Click Add... The Add Stopwords window opens.
  8. Click in the text box.
  9. Press CTRL-V to paste all of the Gist terms into the text box.
  10. Click Done. The words are added to the Current Stopwords list.
  11. Click Finish. Reprocessing of the data set with the new stopwords begins.

See Stopwords for information on how to modify the default stopwords file.

To see helpful information about a part of the Gist window
Hover the cursor over it. Information will appear in the Help panel at the bottom of the window.