Maximum csv file size for loadData

Adam2 · March 11, 2015, 11:02am

Hi, I’m trying to use Liquibase to import data into a large database table. I’m running with this command:

liquibase --driver=oracle.jdbc.OracleDriver ^
–classpath=“C:\tools\tomcat-ex\lib\oracle-jdbc-11.2.0.2.0.jar” ^
–changeLogFile=abc-schema-base.xml ^
–url=“jdbc:oracle:thin:@abc.def:1521:hijklmnop” ^
–username=abcdef ^
–password=abcdef ^
–logLevel=DEBUG ^
update

The changeset is using loadData to import a csv file. The csv file I’m importing is 650Mb. Whenever I run the update, I get OutOfMemoryErrors (after a couple of hours of waiting).

Both the changeset and csv were generated with generateChangeLog. So full marks to Liquibase for at least being able to extract the large table!

I’m running with -Xmx2048m, and Liquibase 3.3.2.

I’m considering breaking the csv into multiple smaller files and multiple changesets, but I’m unsure how small the csv’s would need to be. Does anyone have any info on the largest csv filesize Liquibase loadData can handle?

Has anyone else encountered and resolved this kind of problem?

Alternatively I can exclude this table from my Liquibase scripts as it’s the only one causing issues.

Thanks in advance,
Adam.

un1426044841463r14id · March 11, 2015, 11:02am

Thanks for your response Steve,

The reason for doing this is that I’m migrating a legacy application to Liquibase, and it would be convenient if I could build the entire database from liquibase, rather than using multiple tools. The main benefit Liquibase gives me is versioning with changesets, and running an import from another tool won’t easily fit into the changset model.

Currently my legacy application has many unmanaged and unversioned database snapshots, which makes it very difficult to run repeatable CI cycles of: develop, build, test, release, deploy.

Like you say, there are more efficient ways to load large datasets, and I may need to take that approach instead.

un1382561729492r88id · March 11, 2015, 11:02am

Seems like something that should and could be fixed in Liquibase.

Just curious - why you are using Liquibase to load that much data? What is the use case?

I would recommend that you use native tools if possible when loading that much data into a database. Liquibase is really intended for managing the structure of a database rather than the contents. It is able to work with the data also, but it is intended mainly for loading small sets of data - tables full of constants for example, or small test data. Since Liquibase works at the JDBC level, what it ends up doing is generating tons of INSERT statements, which is an extremely inefficient way to load bulk data.

Steve Donie
Principal Software Engineer
Datical, Inc. http://www.datical.com/

Topic		Replies	Views
liquibase bulk upload General Discussion	5	3084	February 15, 2011
load data csv performance General Discussion	3	2199	February 21, 2011
liquibase 2.0.1: out of java heap space exporting data General Discussion	1	946	April 18, 2011
Get an error when loading a sql format changelogs with large amount of data General Discussion	0	449	May 16, 2019
Data Export/Import Heapspace And String Literal Issues Liquibase Development	0	605	June 4, 2015

Maximum csv file size for loadData

Related topics