Cockroach: sql: encountered pq: command is too large for CREATE TABLE AS and INSERT INTO ... SELECT operations

Created on 23 May 2018 · 9Comments · Source: cockroachdb/cockroach

Created 2 column table with 2.7M rows and tried to CTAS from primary table to new table and encountered the following issue.

root@:26257/test> select count(1) from accounts;
+---------+
|  count  |
+---------+
| 2790683 |
+---------+
(1 row)


root@:26257/test> insert into accounts2 select * from accounts;
pq: command is too large: 116025971 bytes (max: 67108864)
root@:26257/test> drop table accounts2;
DROP TABLE

Time: 30.228115ms

root@:26257/test> create table accounts2 as select * from accounts;
pq: command is too large: 345539008 bytes (max: 67108864)
root@:26257/test> select * from crdb_internal.node_statement_statistics;
+---------+------------------+-------+--------------------------------------

A-sql-execution C-enhancement

Source

drewdeally

Most helpful comment

With #38374 now merged, I ran a benchmark on a 4 node default configuration cluster to compare CTAS performance in the 19.1.2 release, and on master.
The previous implementation would error out with pq: command is too large for greater than 300k rows, thus I used a bank table with 300k rows (100MiB) in my testing.

Average over 5 runs:
v19.1.2 - 7.88s
master - 1.02s
So we observe an 87% speedup in smaller tables.

I then tested the new implementation with source tables of varying sizes all the way up to a 25 GiB, 50 million row table which it completed in 18m14s.

adityamaru27 on 10 Jul 2019

👍6 🎉1

All 9 comments

The root cause of this issue is the limit on transaction size. INSERT INTO ... SELECT is naturally affected by this, because it has to preserve transactionality.

CREATE TABLE AS, on the other hand, does not have to be transactional. We should be able to support large CREATE TABLE AS statements by breaking them up into multiple transactions that use a consistent transaction timestamp, I think.

jordanlewis on 23 May 2018

cc @tschottdorf @danhhz - perhaps there's a fancier way to do this with SSTable rewriting?

jordanlewis on 23 May 2018

CREATE TABLE AS does need to be transactional - the resulting table should come into existence all at once from the perspective of any outside observer.

Note that the transaction size limit has been significantly raised in 2.0, but CREATE TABLE AS runs into the (much smaller) command size limit. A transactional CREATE TABLE AS that splits its work into multiple KV-level commands would be able to handle larger tables (although it would still be slower than using the export/import machinery).

bdarnell on 23 May 2018

Adjusted the title to make this easier to search for.

benesch on 24 Jul 2018

So Please confirm CTAS or CTAS with 1=2 and IAS would be the same issue.. I am sure of it - just want to confirm.

drewdeally on 24 Jul 2018

@awoods187 @jordanlewis @rolandcrosby
After discussion with @dt, we considered breaking the change into two stages:

Make CTAS a job.
Use bulk ingestion to execute the writes much faster.

The migration to a job would imply that the table created by CTAS would not be usable within the same txn, as jobs are only processed following a commit. Just wanted to run this by the Execution team before any dev work is started.

adityamaru27 on 28 May 2019

Average over 5 runs:
v19.1.2 - 7.88s
master - 1.02s
So we observe an 87% speedup in smaller tables.

I then tested the new implementation with source tables of varying sizes all the way up to a 25 GiB, 50 million row table which it completed in 18m14s.

adityamaru27 on 10 Jul 2019

👍6 🎉1

This is awesome @adityamaru27! I am super psyched about it.

jordanlewis on 10 Jul 2019

Very nice work @adityamaru27.