Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The intermediate data is stored in the following folders:

Repository Folder

When the process flow is executed, data from the source is converted to an intermediate form and then dispatched to the target. The intermediate data is stored in the repository folder. This must be a shared folder in the network and can be accessed by all the nodes of the cluster. There should not be any username/password required to connect this folder.

Recovery Folder

During execution of a process flow, its current state is stored in a recovery file. This recovery file is stored in a recovery folder. Whenever a process flow aborts due to Kernel shutdown, the Recovery feature handles it automatically with the help of the recovery files. These files, remains in the recovery folder unless the process flow execution is completed. This folder should be shared among all the nodes of the cluster.

Rerun Folder

The process flows may be aborted due to various reasons (for example, incorrect data mapping or schema definition). While execution, at every checkpoint, the Process Flow stores its current state in a rerun file. With the help of these files you can rerun the process flow. These files are stored in a rerun folder and should be shared among all the nodes of the cluster.

Others

When applicable, the following additional folders must also be shared amongst all the nodes of the cluster. 

...