Creating Dataset Subsets
Video tutorial available.
It is possible to create a subset dataset that contains some but not all of the documents in an existing dataset. In general, there are two ways to do
this:
Types of Subsets
From Selection. Includes all currently selected documents.
From Focus. Includes all the documents in the current focus (specified by the Set Focus panel in the right sidebar).
From Outside Focus: Includes all documents that are not included in the current focus.
How to Create a Subset
To create a subset by selecting documents for inclusion
- Select the documents you want to include in the subset.
- Create a new group from the selected documents using the Groups panel in the right sidebar. As you discover documents to add, you can add them to the group, or after selecting documents, you can create a subset.
- From the IN-SPIRE main menu, choose File > Subset from Selection. Go to Step 4 (below).
To create a subset from the current Focus
- Select the documents you want to focus on and click the Focusbutton in the Selection panel in the right sidebar –OR–
Drag groups to the upper box in the Set Focus panel and click that panel's Focus button.
- From the IN-SPIRE main menu, choose File > Subset from Focus. Go to Step 4 (below).
To create a subset from Outside Focus
- Select
documents you want to focus on and click the Focusbutton in the Selection panel in the right sidebar
–OR–
Drag groups to the upper box in the Set Focus panel and click that panel's Focus button.
- From the IN-SPIRE main menu, choose File > Subset from Outside Focus. Go to Step 4 (below).
All types
- Once the Dataset Wizard window opens, edit the name
(the default is "<dataset name> Subset") and any of the
settings (stopwords, punctuation rules, and stopmajor list). For further
information, see Stopwords, Punctuation Rules or Stopmajor List. You should not need to edit the Fields.
- Click Finish.
The subset dataset is processed and appears in the list of datasets in the Projects window.