|
*Myriad* is a development toolkit for scalable data generation. Generating large synthetic datasets with a certain schema and a set of statistical properties is a challenging yet increasingly important task, especially in the context of benchmarking and testing systems designed for management of web-scale data like [Hadoop](hadoop.apache.org) or parallel RDBMS like [DB2](www-01.ibm.com/software/data/db2/). The *Myriad Toolkit* aims to ease this process by offering a fast and easy way to develop data generators that can generate dependent data on independently running nodes.
|
|
*Myriad* is a development toolkit for scalable data generators. Generating large synthetic datasets with a certain schema and a set of statistical properties is a challenging yet increasingly important task, especially in the context of benchmarking and testing systems designed for management of web-scale data like [Hadoop](hadoop.apache.org) or parallel RDBMS like [DB2](www-01.ibm.com/software/data/db2/). The *Myriad Toolkit* aims to ease this process by offering a fast and easy way to develop data generators that can generate dependent data on independently running nodes.
|