博客
关于我
如何在Java中生成大量随机CSV文件
阅读量:732 次
发布时间:2019-03-22

本文共 2853 字,大约阅读时间需要 9 分钟。

The CSV format remains one of the simplest and most widely used methods for data exchange. Its popularity has led to a constant need for developers to generate large volumes of CSV files for testing purposes. My latest open source project aims to address this need.

This project, , is a Java library designed to help developers generate randomized data files. Currently, it supports CSV and Fixed Width formats, with plans to expand to JSON formats in the future.

This guide will walk you through generating a simple CSV file using the library and the Faker library for random data generation.

Maven Integration

For Maven projects, you can add the required dependencies to your project's pom.xml file by including the following XML snippet:

au.com.anthonybruno        SdGen        0.3.0                com.github.javafaker        javafaker        0.14

Instructions

First, obtain an instance of the Faker class with the following code:

Faker faker = Faker.instance();

Next, use the Faker instance to generate values like URLs or names using methods such as faker.internet().url() or faker.space().planet().

To create the structure for your CSV file, use the SDGen builder. Start with:

Gen.start()

Then, add fields using the addField method. This method takes two parameters: the field name (used to identify the column in the generated file) and a generator (a simple interface with a single method to generate random values). Here's how to add "First Name" and "Last Name" fields:

Gen.start()    .addField("First Name", () -> faker.name().firstName())    .addField("Last Name", () -> faker.name().lastName())

For the "Age" field, you can use SDGen's built-in generator with specified min and max values:

Gen.start()    .addField("First Name", () -> faker.name().firstName())    .addField("Last Name", () -> faker.name().lastName())    .addField("Age", new IntGenerator(18, 80))

Next, specify the number of rows to generate and select the output format. For CSV files, you can use:

Gen.start()    .addField("First Name", () -> faker.name().firstName())    .addField("Last Name", () -> faker.name().lastName())    .addField("Age", new IntGenerator(18, 80))    .generate(1000)    .asCsv()

Finally, specify how to output the data. Use the toFile method to save the generated data to a file:

Gen.start()    .addField("First Name", () -> faker.name().firstName())    .addField("Last Name", () -> faker.name().lastName())    .addField("Age", new IntGenerator(18, 80))    .generate(1000)    .asCsv()    .toFile("people.csv");

That's it! Running the code will generate a CSV file in your project's working directory. Here's an example of the generated data:

First Name,Last Name,Age  
Corrine,Berge,78
Gerald,Carter,63
Enid,Padberg,66
Eleanora,Murray,79
Coy,Okuneva,76
Jovan,Reynolds,77
Lane,Haag,48

For more details about SDGen, visit its official website.

转载地址:http://qmwwk.baihongyu.com/

你可能感兴趣的文章
Mysql: 对换(替换)两条记录的同一个字段值
查看>>
mysql:Can‘t connect to local MySQL server through socket ‘/var/run/mysqld/mysqld.sock‘解决方法
查看>>
MYSQL:基础——3N范式的表结构设计
查看>>
MYSQL:基础——触发器
查看>>
Mysql:连接报错“closing inbound before receiving peer‘s close_notify”
查看>>
mysqlbinlog报错unknown variable ‘default-character-set=utf8mb4‘
查看>>
mysqldump 参数--lock-tables浅析
查看>>
mysqldump 导出中文乱码
查看>>
mysqldump 导出数据库中每张表的前n条
查看>>
mysqldump: Got error: 1044: Access denied for user ‘xx’@’xx’ to database ‘xx’ when using LOCK TABLES
查看>>
Mysqldump参数大全(参数来源于mysql5.5.19源码)
查看>>
mysqldump备份时忽略某些表
查看>>
mysqldump实现数据备份及灾难恢复
查看>>
mysqldump数据库备份无法进行操作只能查询 --single-transaction
查看>>
mysqldump的一些用法
查看>>
mysqli
查看>>
MySQLIntegrityConstraintViolationException异常处理
查看>>
mysqlreport分析工具详解
查看>>
MySQLSyntaxErrorException: Unknown error 1146和SQLSyntaxErrorException: Unknown error 1146
查看>>
Mysql_Postgresql中_geometry数据操作_st_astext_GeomFromEWKT函数_在java中转换geometry的16进制数据---PostgreSQL工作笔记007
查看>>