Creating Dataset Subsets

Video tutorial available.


It is possible to create a subset dataset that contains some but not all of the documents in an existing dataset. In general, there are two ways to do this:

Types of Subsets

From Selection. Includes all currently selected documents.

From Focus. Includes all the documents in the current focus (specified by the Set Focus panel in the right sidebar).

From Outside Focus: Includes all documents that are not included in the current focus.

How to Create a Subset

To create a subset by selecting documents for inclusion

  1. Select the documents you want to include in the subset.
  2. Create a new group from the selected documents using the Groups panel in the right sidebar. As you discover documents to add, you can add them to the group, or after selecting documents, you can create a subset.
  3. From the IN-SPIRE main menu, choose File > Subset from Selection. Go to Step 4 (below).

To create a subset from the current Focus

  1. Select the documents you want to focus on and click the FocusFocus button on the Selection panelbutton in the Selection panel in the right sidebar –OR–
    Drag groups to the upper box in the Set Focus panel and click that panel's Focus Focus button in the Set Focus panelbutton.
  2. From the IN-SPIRE main menu, choose File > Subset from Focus. Go to Step 4 (below).

To create a subset from Outside Focus

  1. Select documents you want to focus on and click the FocusFocus button on the Selection panelbutton in the Selection panel in the right sidebar
    –OR–
    Drag groups to the upper box in the Set Focus panel and click that panel's Focus Focus button in the Set Focus panelbutton.
  2. From the IN-SPIRE main menu, choose File > Subset from Outside Focus. Go to Step 4 (below).

All types

  1. Once the Dataset Wizard window opens, edit the name (the default is "<dataset name> Subset") and any of the settings (stopwords, punctuation rules, and stopmajor list). For further information, see Stopwords, Punctuation Rules or Stopmajor List. You should not need to edit the Fields.
  2. Click Finish. The subset dataset is processed and appears in the list of datasets in the Projects window.