Welcome

Data Sets
Overview
Creating New
--ASCII Text
--XML
--Google Harvest
--Web Harvest
Settings
--Fields
--Stopwords
--Stopmajors
--Punctuation Rules
Editing
Merging
Exporting
Importing
Subsetting

Visualizations
Galaxy
--Basics
--Outliers
ThemeView
Settings

Tools
Document Viewer
Gist
Groups
--Basics
--Evidence Panel
Major Terms
Queries
Print
Probe
Time Slicer

About version 2.2
Overview
Known issues

Merging Data Sets

Accessing the Data Set Editor

If IN-SPIRE is running, from the main IN-SPIRE menu bar, choose File > Data Sets... Alternatively, on start up, the Data Set Editor window opens after the splash screen has appeared.

Basic Steps

  1. On the Data Set Editor window, in the list of data sets, click on one of the ones you wish to merge.
  2. Click Merge. The Compatible Data Sets window opens and a list of all data sets which may be merged together appears in it. For what data sets are compatible, see below.
  3. If the data set that you want to merge with is in the list, click on it to select it.
  4. Click Merge. The data sets will be merged together.

What data sets are compatible?

The requirements for compatible data sets are quite stringent in this release of IN-SPIRE. Compatible data sets have:

  • The same source data type. Type may be: ASCII Text, XML, FBIS Portal Harvest, Google Harvest or Web Harvest.
  • The same Stopwords (use the same stopwords file).
  • The same format and the same fields defined.

    This means that if you used the same source data files for two different data sets but you processed them differently by defining different fields, the two data sets will not be compatible.

Two compatible data sets may have different Stopmajor files. If they do, then the Stopmajor file for the merged data set is made up of all the words from the first plus all of the words from the second data set's Stopmajor file.

In a merge, what happens to saved Groups and Queries?

All groups and queries will be preserved. If there are name collisions, the source data set name will be prepended to the Group name. For example, suppose there is a Group called "Support" in both data set "DS1" and data set "DS2". If you merge these two data sets, the result will have a Group folder called "Support" which will contain groups called "DS1:Support" and "DS2:Support".