Analytics depend on real (or realistic) data sets to demonstrate for users the true power of the application. Although reasonable data sets exist, sometimes you just need a custom data set for a particular industry or line of business problem. You might also need a large data set, one with a specific granularity (day / hour / etc), or want to maintain some referential integrity between different dimension or measure values. Of course, you could do this in Excel, but it is challenging to manage relationships between dimensions, enforce constraints in measure values, and create large volumes of sample data.
This repository contains a NodeJS powered "engine" that works with Excel templates containing the dimensions, measures, and constraints. It produces fact data, representing the cartesian product of all dimensions, with values at each intersection.
- Install NodeJS
- Your own SAP Analytics Cloud tenant or a trial account
- Install Git or clone the code from your browser
Run the following commands to clone the repository and execute the included sample:
git clone https://github.com/SAP-samples/analytics-cloud-data-generator.git
cd analytics-cloud-data-generator/code
npm install
node d2v_datagen.js
Execute the code using the samples provided
Enter file name for dimension list: CENSUS_STAFFING
Or
Enter file name for dimension list: SERVICE_TIMES
- Samples and documentation for creating fact data can be found in the input directory
- Sample outputs can be found in the output directory
- Example dimensions for model creation in SAP Analytics Cloud are located in the model folder
No known issues at this time.
Create an issue in this repository if you find a bug or have questions about the content.
If you would like to contribute, please submit a pull request in the usual fashion
Copyright (c) 2021 SAP SE or an SAP affiliate company. All rights reserved. This project is licensed under the Apache Software License, version 2.0 except as noted otherwise in the LICENSE file.