50
Chapter 5
Click Save Node to save the node’s details in a le. You can load node details only into
another node of the same type.
Click Store Node to store the selected node in a connected IBM SPSS Collaboration and
Deployment Services Repository.
Click Cache to expand the menu, with options for caching the selected node.
Click Data Mapping to expand the menu, with options for mapping data to a new source or
specifying mandatory elds.
Click Create SuperNode to expand the menu, with options for creating a SuperNode in the
current stream.
Click Generate User Input Node to replace the selected node. Examples generated by th isn ode
will have the same elds as the current node.
Click Run From Here to run all terminal nodes downstream from the selected node.
Caching Options for Nodes
Tooptimize stream run ning, you can set up a cache on any nonterminal node. When you set up a
cache on a node, the cache is lled with the data that passes through the node the next time you
run the data stream. From then on, the data is read from the cache (which is st ored on disk in a
temporary directory) rather than from the data source.
Caching is most useful following a time-consuming operation such as a sort, merge, or
aggregation. For example, suppose that y ou have a source node set to read sales data from a
database and an Aggregate node that summarizes sales by location. You can set up a cache on the
Aggregate node rather than on the source node because you want the cache to store the ag gregated
data rather than the entire data set.
Note: Caching at source nodes, which sim ply stores a copy of the original data as it is read into
IBM® SPSS® Modeler, will not improve performance in most circumstances.
Nodes with caching enabled are displayed with a small document icon at the t op right corner.
When the data is cached at the node, the document icon is green.