|
<span class="float-left"><img src="/TU-Berlin-DIMA/myriad-toolkit/wiki/img/myriad_logo.floatleft.png" alt="Myriad Toolkit logo" /></span> *Myriad* is a development toolkit for scalable data generators. Generating large, synthetic datasets with a certain schema and a set of statistical constraints is a challenging yet increasingly important task, especially in the context of benchmarking and testing systems for web-scale data management or parallel RDBMS (e.g. [Hadoop](hadoop.apache.org), [DB2](www-01.ibm.com/software/data/db2/)). The *Myriad Toolkit* aims to simplify this process by providing a fast and easy way to develop data generators that can generate *dependent data* in parallel with a set of *independently running nodes*.
|
|
<span class="float-left"><img src="/TU-Berlin-DIMA/myriad-toolkit/wiki/img/myriad_logo.floatleft.png" alt="Myriad Toolkit logo" /></span> *Myriad* is a development toolkit for scalable data generators. Generating large, synthetic datasets with a certain schema and a set of statistical constraints is a challenging yet increasingly important task, especially in the context of benchmarking and testing systems for web-scale data management or parallel RDBMS (e.g. [Hadoop](hadoop.apache.org), [DB2](www-01.ibm.com/software/data/db2/)). The *Myriad Toolkit* aims to simplify this process by providing a fast and easy way to develop data generators that can generate *statistically dependent data* in parallel with a set of *independently running nodes*.
|