Scylla: nodetool repair and nodetool removenode comands failed during repair process running

Created on 26 Jan 2021  路  23Comments  路  Source: scylladb/scylla

Installation details
Scylla version (or git commit hash): 4.4.rc0-0.20210125.f470c5d4d with build-id ca840e9f9c508f2b3fb3d4176d3f64127857285a
Cluster size: 6 nodes (i3.4xlarge)
OS (RHEL/CentOS/Ubuntu/AWS AMI): ami-0e045d50c5a6c9a4a (aws: eu-north-1)

Jenkins job URL
Test: longevity-cdc-4h-test
Test name: longevity_test.LongevityTest.test_custom_time
Test id: a92a84b6-e2e9-4d07-8aa6-a85bf96363f8
Test config file(s):

  • [longevity-cdc-100gb-4h.yaml] (https://github.com/scylladb/scylla-cluster-tests/blob/ca8a02961c8a045df4e2ea49d1cc2fde06d11620/test-cases/longevity/longevity-cdc-100gb-4h.yaml)

Issue description

====================================
The job run 4 c-s command with user profile which create 4 tables with different cdc properties. The exact profile settings could be found here:

  • https://github.com/scylladb/scylla-cluster-tests/blob/master/data_dir/cdc_profile.yaml
  • https://github.com/scylladb/scylla-cluster-tests/blob/master/data_dir/cdc_profile_preimage.yaml
  • https://github.com/scylladb/scylla-cluster-tests/blob/master/data_dir/cdc_profile_postimage.yaml
  • https://github.com/scylladb/scylla-cluster-tests/blob/master/data_dir/cdc_profile_preimage_postimage.yaml

After the Major compaction nemsis successfully running, nemesis TerminateAndRemove node running on node3. This nemesis stops the scylla, then run repair on each alive node, after nemesis run nodetool removenode from on of alive node.

Scylla was stopped successfully on node3:

2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 7] migration_manager - stopping migration service
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down migration manager was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down database
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down database was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down sst_dir_semaphore
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down sst_dir_semaphore was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down messaging service
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down messaging service was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down API server
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down API server was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down migration manager notifier
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down migration manager notifier was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down prometheus API server
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down prometheus API server was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down sighup
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Shutting down sighup was successful
2021-01-25T20:07:04+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-3 !INFO    | scylla: [shard 0] init - Scylla version 4.4.rc0-0.20210125.f470c5d4d shutdown complete.

Then repair process was executed on each node. But the reapair command was failed on each node
node1

2021-01-25T20:08:13+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 5] repair - repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 5 failed: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 5 failed to repair 109 sub ranges)
2021-01-25T20:08:13+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 0] repair - repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 0 failed to repair 109 sub ranges), shard 1: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 1 failed to repair 109 sub ranges), shard 2: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 2 failed to repair 109 sub ranges), shard 3: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 3 failed to repair 109 sub ranges), shard 4: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 4 failed to repair 109 sub ranges), shard 5: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 5 failed to repair 109 sub ranges), shard 6: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 6 failed to repair 109 sub ranges), shard 7: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 7 failed to repair 109 sub ranges), shard 8: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 8 failed to repair 109 sub ranges), shard 9: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 9 failed to repair 109 sub ranges), shard 10: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 10 failed to repair 109 sub ranges), shard 11: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 11 failed to repair 109 sub ranges), shard 12: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 12 failed to repair 109 sub ranges), shard 13: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 13 failed to repair 109 sub ranges)})
2021-01-25T20:08:13+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 0] repair - repair_tracker run for repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 0 failed to repair 109 sub ranges), shard 1: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 1 failed to repair 109 sub ranges), shard 2: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 2 failed to repair 109 sub ranges), shard 3: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 3 failed to repair 109 sub ranges), shard 4: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 4 failed to repair 109 sub ranges), shard 5: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 5 failed to repair 109 sub ranges), shard 6: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 6 failed to repair 109 sub ranges), shard 7: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 7 failed to repair 109 sub ranges), shard 8: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 8 failed to repair 109 sub ranges), shard 9: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 9 failed to repair 109 sub ranges), shard 10: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 10 failed to repair 109 sub ranges), shard 11: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 11 failed to repair 109 sub ranges), shard 12: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 12 failed to repair 109 sub ranges), shard 13: std::runtime_error (repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 13 failed to repair 109 sub ranges)})

< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > sdcm.nemesis.ChaosMonkey: failed to execute repair command on node Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True) due to the following error: Encountered a bad command exit code!
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > Command: '/usr/bin/nodetool  repair '
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > Exit code: 2
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > Stdout:
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > Using /etc/scylla/scylla.yaml as the config file
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:09,497] Starting repair command #2, repairing 1 ranges for keyspace system_traces (parallelism=SEQUENTIAL, full=true)
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:13,604] Repair session 2 failed
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:13,607] Repair session 2 finished
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > Stderr:
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > error: Repair job has failed with the error message: [2021-01-25 20:08:13,604] Repair session 2 failed
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > -- StackTrace --
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR > java.lang.RuntimeException: Repair job has failed with the error message: [2021-01-25 20:08:13,604] Repair session 2 failed
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.tools.RepairRunner.progress(RepairRunner.java:124)
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.utils.progress.jmx.JMXNotificationProgressListener.handleNotification(JMXNotificationProgressListener.java:77)
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.dispatchNotification(ClientNotifForwarder.java:583)
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.doRun(ClientNotifForwarder.java:533)
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.run(ClientNotifForwarder.java:452)
< t:2021-01-25 20:08:13,972 f:nemesis.py      l:2187 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$LinearExecutor$1.run(ClientNotifForwarder.java:108)

node2

2021-01-25T20:08:26+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !WARNING | scylla: [shard 5] repair - repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 5 failed: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 5 failed to repair 113 sub ranges)
2021-01-25T20:08:26+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !WARNING | scylla: [shard 0] repair - repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 0 failed to repair 113 sub ranges), shard 1: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 1 failed to repair 113 sub ranges), shard 2: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 2 failed to repair 113 sub ranges), shard 3: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 3 failed to repair 113 sub ranges), shard 4: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 4 failed to repair 113 sub ranges), shard 5: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 5 failed to repair 113 sub ranges), shard 6: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 6 failed to repair 113 sub ranges), shard 7: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 7 failed to repair 113 sub ranges), shard 8: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 8 failed to repair 113 sub ranges), shard 9: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 9 failed to repair 113 sub ranges), shard 10: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 10 failed to repair 113 sub ranges), shard 11: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 11 failed to repair 113 sub ranges), shard 12: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 12 failed to repair 113 sub ranges), shard 13: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 13 failed to repair 113 sub ranges)})
2021-01-25T20:08:26+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !WARNING | scylla: [shard 0] repair - repair_tracker run for repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 0 failed to repair 113 sub ranges), shard 1: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 1 failed to repair 113 sub ranges), shard 2: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 2 failed to repair 113 sub ranges), shard 3: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 3 failed to repair 113 sub ranges), shard 4: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 4 failed to repair 113 sub ranges), shard 5: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 5 failed to repair 113 sub ranges), shard 6: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 6 failed to repair 113 sub ranges), shard 7: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 7 failed to repair 113 sub ranges), shard 8: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 8 failed to repair 113 sub ranges), shard 9: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 9 failed to repair 113 sub ranges), shard 10: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 10 failed to repair 113 sub ranges), shard 11: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 11 failed to repair 113 sub ranges), shard 12: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 12 failed to repair 113 sub ranges), shard 13: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 13 failed to repair 113 sub ranges)})

< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Command: '/usr/bin/nodetool  repair '
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Exit code: 2
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Stdout:
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Using /etc/scylla/scylla.yaml as the config file
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:23,032] Starting repair command #1, repairing 1 ranges for keyspace system_traces (parallelism=SEQUENTIAL, full=true)
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:27,137] Repair session 1 failed
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:27,139] Repair session 1 finished
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Stderr:
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > error: Repair job has failed with the error message: [2021-01-25 20:08:27,137] Repair session 1 failed
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > -- StackTrace --
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > java.lang.RuntimeException: Repair job has failed with the error message: [2021-01-25 20:08:27,137] Repair session 1 failed
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.tools.RepairRunner.progress(RepairRunner.java:124)
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.utils.progress.jmx.JMXNotificationProgressListener.handleNotification(JMXNotificationProgressListener.java:77)
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.dispatchNotification(ClientNotifForwarder.java:583)
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.doRun(ClientNotifForwarder.java:533)
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.run(ClientNotifForwarder.java:452)
< t:2021-01-25 20:08:27,504 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$LinearExecutor$1.run(ClientNotifForwarder.java:108)

node4

2021-01-25T20:08:38+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !WARNING | scylla: [shard 3] repair - repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 3 failed: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 3 failed to repair 104 sub ranges)
2021-01-25T20:08:38+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !WARNING | scylla: [shard 0] repair - repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 0 failed to repair 104 sub ranges), shard 1: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 1 failed to repair 104 sub ranges), shard 2: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 2 failed to repair 104 sub ranges), shard 3: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 3 failed to repair 104 sub ranges), shard 4: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 4 failed to repair 104 sub ranges), shard 5: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 5 failed to repair 104 sub ranges), shard 6: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 6 failed to repair 104 sub ranges), shard 7: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 7 failed to repair 104 sub ranges), shard 8: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 8 failed to repair 104 sub ranges), shard 9: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 9 failed to repair 104 sub ranges), shard 10: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 10 failed to repair 104 sub ranges), shard 11: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 11 failed to repair 104 sub ranges), shard 12: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 12 failed to repair 104 sub ranges), shard 13: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 13 failed to repair 104 sub ranges)})
2021-01-25T20:08:38+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !WARNING | scylla: [shard 0] repair - repair_tracker run for repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 0 failed to repair 104 sub ranges), shard 1: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 1 failed to repair 104 sub ranges), shard 2: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 2 failed to repair 104 sub ranges), shard 3: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 3 failed to repair 104 sub ranges), shard 4: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 4 failed to repair 104 sub ranges), shard 5: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 5 failed to repair 104 sub ranges), shard 6: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 6 failed to repair 104 sub ranges), shard 7: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 7 failed to repair 104 sub ranges), shard 8: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 8 failed to repair 104 sub ranges), shard 9: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 9 failed to repair 104 sub ranges), shard 10: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 10 failed to repair 104 sub ranges), shard 11: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 11 failed to repair 104 sub ranges), shard 12: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 12 failed to repair 104 sub ranges), shard 13: std::runtime_error (repair id [id=1, uuid=f3dd6fcd-fe9d-47f6-85b5-cb9977de5f79] on shard 13 failed to repair 104 sub ranges)})



< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Command: '/usr/bin/nodetool  repair '
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Exit code: 2
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Stdout:
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Using /etc/scylla/scylla.yaml as the config file
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:34,425] Starting repair command #1, repairing 1 ranges for keyspace system_traces (parallelism=SEQUENTIAL, full=true)
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:38,532] Repair session 1 failed
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:38,533] Repair session 1 finished
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Stderr:
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > error: Repair job has failed with the error message: [2021-01-25 20:08:38,532] Repair session 1 failed
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > -- StackTrace --
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > java.lang.RuntimeException: Repair job has failed with the error message: [2021-01-25 20:08:38,532] Repair session 1 failed
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.tools.RepairRunner.progress(RepairRunner.java:124)
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.utils.progress.jmx.JMXNotificationProgressListener.handleNotification(JMXNotificationProgressListener.java:77)
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.dispatchNotification(ClientNotifForwarder.java:583)
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.doRun(ClientNotifForwarder.java:533)
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.run(ClientNotifForwarder.java:452)
< t:2021-01-25 20:08:38,890 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$LinearExecutor$1.run(ClientNotifForwarder.java:108)

node5

SHA256:1YmYWh7roz4Y3FSY7N8CetmQNlK/K5FeBPITC8KF0u8
2021-01-25T20:15:35+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 11] repair - repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] on shard 11 failed: std::runtime_error (Failed to repair for keyspace=cdc_test, cf=test_table_preimage_postimage, range=(-2919243402414869945, -2901690257701099699])
2021-01-25T20:15:35+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | systemd-logind: New session 694 of user scyllaadm.
2021-01-25T20:15:35+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 0] repair - repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] failed: std::runtime_error ({shard 0: std::runtime_error (get_repair_meta: repair_meta_id 40137 for node 10.0.0.86 does not exist), shard 1: std::runtime_error (get_repair_meta: repair_meta_id 40491 for node 10.0.0.86 does not exist), shard 2: seastar::rpc::closed_error (connection is closed), shard 3: seastar::rpc::closed_error (connection is closed), shard 4: seastar::rpc::closed_error (connection is closed), shard 5: seastar::rpc::closed_error (connection is closed), shard 6: seastar::rpc::closed_error (connection is closed), shard 7: std::runtime_error (get_repair_meta: repair_meta_id 40295 for node 10.0.0.86 does not exist), shard 8: seastar::rpc::closed_error (connection is closed), shard 9: std::runtime_error (get_repair_meta: repair_meta_id 40457 for node 10.0.0.86 does not exist), shard 10: seastar::rpc::closed_error (connection is closed), shard 11: std::runtime_error (Failed to repair for keyspace=cdc_test, cf=test_table_preimage_postimage, range=(-2919243402414869945, -2901690257701099699]), shard 12: std::runtime_error (repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] is aborted on shard 12), shard 13: std::runtime_error (get_repair_meta: repair_meta_id 40472 for node 10.0.0.86 does not exist)})
2021-01-25T20:15:35+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | systemd: Started Session 694 of user scyllaadm.
2021-01-25T20:15:35+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 0] repair - repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] to sync data for keyspace=cdc_test, status=failed: std::runtime_error ({shard 0: std::runtime_error (get_repair_meta: repair_meta_id 40137 for node 10.0.0.86 does not exist), shard 1: std::runtime_error (get_repair_meta: repair_meta_id 40491 for node 10.0.0.86 does not exist), shard 2: seastar::rpc::closed_error (connection is closed), shard 3: seastar::rpc::closed_error (connection is closed), shard 4: seastar::rpc::closed_error (connection is closed), shard 5: seastar::rpc::closed_error (connection is closed), shard 6: seastar::rpc::closed_error (connection is closed), shard 7: std::runtime_error (get_repair_meta: repair_meta_id 40295 for node 10.0.0.86 does not exist), shard 8: seastar::rpc::closed_error (connection is closed), shard 9: std::runtime_error (get_repair_meta: repair_meta_id 40457 for node 10.0.0.86 does not exist), shard 10: seastar::rpc::closed_error (connection is closed), shard 11: std::runtime_error (Failed to repair for keyspace=cdc_test, cf=test_table_preimage_postimage, range=(-2919243402414869945, -2901690257701099699]), shard 12: std::runtime_error (repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] is aborted on shard 12), shard 13: std::runtime_error (get_repair_meta: repair_meta_id 40472 for node 10.0.0.86 does not exist)})
2021-01-25T20:15:35+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | sshd[24488]: pam_unix(sshd:session): session opened for user centos by (uid=0)
2021-01-25T20:15:36+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | scylla: [shard 4] compaction - [Compact cdc_test.test_table_postimage e83c7660-5f49-11eb-a642-000000000003] Compacted 32 sstables to [/var/lib/scylla/data/cdc_test/


< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Command: '/usr/bin/nodetool  repair '
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Exit code: 2
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Stdout:
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Using /etc/scylla/scylla.yaml as the config file
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:45,736] Starting repair command #1, repairing 1 ranges for keyspace system_auth (parallelism=SEQUENTIAL, full=true)
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:50,843] Repair session 1 failed
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > [2021-01-25 20:08:50,844] Repair session 1 finished
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > Stderr:
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > 
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > error: Repair job has failed with the error message: [2021-01-25 20:08:50,843] Repair session 1 failed
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > -- StackTrace --
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR > java.lang.RuntimeException: Repair job has failed with the error message: [2021-01-25 20:08:50,843] Repair session 1 failed
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.tools.RepairRunner.progress(RepairRunner.java:124)
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at org.apache.cassandra.utils.progress.jmx.JMXNotificationProgressListener.handleNotification(JMXNotificationProgressListener.java:77)
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.dispatchNotification(ClientNotifForwarder.java:583)
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.doRun(ClientNotifForwarder.java:533)
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$NotifFetcher.run(ClientNotifForwarder.java:452)
< t:2021-01-25 20:08:51,205 f:nemesis.py      l:2193 c:sdcm.nemesis         p:ERROR >   at com.sun.jmx.remote.internal.ClientNotifForwarder$LinearExecutor$1.run(ClientNotifForwarder.java:108)

node6

2021-01-25T20:08:26+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !WARNING | scylla: [shard 0] repair - repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 0 failed to repair 113 sub ranges), shard 1: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 1 failed to repair 113 sub ranges), shard 2: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 2 failed to repair 113 sub ranges), shard 3: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 3 failed to repair 113 sub ranges), shard 4: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 4 failed to repair 113 sub ranges), shard 5: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 5 failed to repair 113 sub ranges), shard 6: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 6 failed to repair 113 sub ranges), shard 7: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 7 failed to repair 113 sub ranges), shard 8: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 8 failed to repair 113 sub ranges), shard 9: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 9 failed to repair 113 sub ranges), shard 10: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 10 failed to repair 113 sub ranges), shard 11: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 11 failed to repair 113 sub ranges), shard 12: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 12 failed to repair 113 sub ranges), shard 13: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 13 failed to repair 113 sub ranges)})
2021-01-25T20:08:26+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !WARNING | scylla: [shard 0] repair - repair_tracker run for repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] failed: std::runtime_error ({shard 0: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 0 failed to repair 113 sub ranges), shard 1: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 1 failed to repair 113 sub ranges), shard 2: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 2 failed to repair 113 sub ranges), shard 3: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 3 failed to repair 113 sub ranges), shard 4: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 4 failed to repair 113 sub ranges), shard 5: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 5 failed to repair 113 sub ranges), shard 6: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 6 failed to repair 113 sub ranges), shard 7: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 7 failed to repair 113 sub ranges), shard 8: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 8 failed to repair 113 sub ranges), shard 9: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 9 failed to repair 113 sub ranges), shard 10: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 10 failed to repair 113 sub ranges), shard 11: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 11 failed to repair 113 sub ranges), shard 12: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 12 failed to repair 113 sub ranges), shard 13: std::runtime_error (repair id [id=1, uuid=7bb611a2-de1a-4acd-980e-71420af12923] on shard 13 failed to repair 113 sub ranges)})

After that on node1 the command nodetool removenode was started:

the following host_id: b31be881-3540-4785-9a6b-8ca280e9050c
< t:2021-01-25 20:09:03,532 f:remote_base.py  l:520  c:RemoteCmdRunner      p:DEBUG > Running command "/usr/bin/nodetool  removenode b31be881-3540-4785-9a6b-8ca280e9050c 

2021-01-25T20:09:08+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] storage_service - removenode[680d8ebf-f2bd-4e15-9a4b-f670782c86ad]: Started removenode operation, removing node=10.0.1.116, sync_nodes={10.0.2.15, 10.0.1.199, 10.0.0.86, 10.0.3.211, 10.0.0.196}, ignore_nodes=[]
2021-01-25T20:09:08+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] storage_service - removenode[680d8ebf-f2bd-4e15-9a4b-f670782c86ad]: Added node=10.0.1.116 as leaving node, coordinator=10.0.0.196
2021-01-25T20:09:08+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] storage_service - removenode[680d8ebf-f2bd-4e15-9a4b-f670782c86ad]: Started to sync data for removing node=10.0.1.116, coordinator=10.0.0.196
2021-01-25T20:09:08+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] repair - removenode_with_repair: started with keyspaces={system_traces, system_distributed, system_auth, cdc_test}, leaving_node=10.0.1.116
2021-01-25T20:09:08+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] repair - removenode_with_repair: started with keyspace=system_traces, leaving_node=10.0.1.116, nr_ranges=522
````
but this also failed with next error:

< t:2021-01-25 20:15:53,614 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > nodetool: Scylla API server HTTP POST to URL '/storage_service/remove_node' failed: seastar::rpc::closed_error (connection is closed)


Of one of nodetool status request return next output:

< t:2021-01-25 20:16:03,487 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > Datacenter: eu-north
< t:2021-01-25 20:16:03,489 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > ====================
< t:2021-01-25 20:16:03,489 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > Status=Up/Down
< t:2021-01-25 20:16:03,489 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > |/ State=Normal/Leaving/Joining/Moving
< t:2021-01-25 20:16:03,489 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > -- Address Load Tokens Owns Host ID Rack
< t:2021-01-25 20:16:03,502 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > DN 10.0.3.211 3.06 GB 256 ? e2eac734-9427-489a-8ade-72f9ae675916 1a
< t:2021-01-25 20:16:03,509 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.0.196 3.78 GB 256 ? fa8c2db1-5274-44ba-b24b-0c44c0bf163b 1a
< t:2021-01-25 20:16:03,521 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > DN 10.0.1.116 2.93 GB 256 ? b31be881-3540-4785-9a6b-8ca280e9050c 1a
< t:2021-01-25 20:16:03,528 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > DN 10.0.0.86 3.55 GB 256 ? 74fbe6bc-3720-438d-901d-1c0f952cb76c 1a
< t:2021-01-25 20:16:03,530 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.1.199 3.47 GB 256 ? b36840a1-7384-4f45-a4c9-55c7d5cb4256 1a
< t:2021-01-25 20:16:03,534 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > DN 10.0.2.15 3.73 GB 256 ? 3329c329-627b-4053-bb75-c85cb5a3110e 1a


Node3 stay in DN status in cluster:

< t:2021-01-25 20:18:44,528 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > Datacenter: eu-north
< t:2021-01-25 20:18:44,530 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > ====================
< t:2021-01-25 20:18:44,530 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > Status=Up/Down
< t:2021-01-25 20:18:44,530 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > |/ State=Normal/Leaving/Joining/Moving
< t:2021-01-25 20:18:44,532 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > -- Address Load Tokens Owns Host ID Rack
< t:2021-01-25 20:18:44,540 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.3.211 1.97 GB 256 ? e2eac734-9427-489a-8ade-72f9ae675916 1a
< t:2021-01-25 20:18:44,543 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.0.196 3.34 GB 256 ? fa8c2db1-5274-44ba-b24b-0c44c0bf163b 1a
< t:2021-01-25 20:18:44,546 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > DN 10.0.1.116 2.93 GB 256 ? b31be881-3540-4785-9a6b-8ca280e9050c 1a
< t:2021-01-25 20:18:44,551 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.0.86 2.74 GB 256 ? 74fbe6bc-3720-438d-901d-1c0f952cb76c 1a
< t:2021-01-25 20:18:44,554 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.1.199 2.95 GB 256 ? b36840a1-7384-4f45-a4c9-55c7d5cb4256 1a
< t:2021-01-25 20:18:44,560 f:base.py l:220 c:RemoteCmdRunner p:DEBUG > UN 10.0.2.15 3.49 GB 256 ? 3329c329-627b-4053-bb75-c85cb5a3110e 1a
< t:2021-01-25 20:18:44,560 f:base.py l:220 c:RemoteCmdRunner p:DEBUG >



During the repair process (by user request or during remove nodes)  on different nodes reported several seastar exception backtraces:

2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26656 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 9] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26657 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 9] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26668 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 0] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x2879d3c#012 0x38c97af#012 0x38ca997#012 0x3868865#012 0x3867c86#012 0xdf9e9c#012 /opt/scylladb/libreloc/libc.so.6+0x281e1#012 0xdf6d8d#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
seastar::app_template::run_deprecated(int, char, std::function seastar::app_template::run(int, char, std::function ()>&&) at ./build/release/seastar/./seastar/src/core/app-template.cc:115
main at ./main.cc:483
?? ??:0
_start at ??:?
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26678 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 4] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void*) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0

2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 4] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26680 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 4] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x2879d3c#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26681 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 4] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26682 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 4] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:13:58.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=26683 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:13:58+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 4] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:05.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=26804 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:14:05+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 3] repair - repair id [id=6, uuid=cb03e36d-9911-43f6-b899-a401732ed6cd] on shard 3, keyspace=cdc_test, cf=test_table_postimage, range=(1838719016667512753, 1844477811865142710], got error in row level repair: std::runtime_error (get_repair_meta: repair_meta_id 80249 for node 10.0.0.196 does not exist)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/scheduling.hh:42
(inlined by) repair_meta::get_row_diff_with_rpc_stream(absl::btree_set, std::allocator >, seastar::bool_class, seastar::bool_class, gms::inet_address, unsigned int) at ./repair/row_level.cc:1887
row_level_repair::get_missing_rows_from_follower_nodes(repair_meta&) at ./repair/row_level.cc:2683
operator() at ./repair/row_level.cc:2841
void std::__invoke_impl (inlined by) std::__invoke_result (inlined by) _ZSt12__apply_implIZN16row_level_repair3runEvEUlvE_St5tupleIJEEJEEDcOT_OT0_St16integer_sequenceImJXspT1_EEE at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/tuple:1723
(inlined by) _ZSt5applyIZN16row_level_repair3runEvEUlvE_St5tupleIJEEEDcOT_OT0_ at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/tuple:1734
(inlined by) seastar::future seastar::futurize::apply (inlined by) operator() at ././seastar/include/seastar/core/thread.hh:258
(inlined by) seastar::noncopyable_function::type seastar::async) at ././seastar/include/seastar/util/noncopyable_function.hh:124
seastar::noncopyable_function (inlined by) seastar::thread_context::main() at ./build/release/seastar/./seastar/src/core/thread.cc:297
2021-01-25 20:14:13.000: (DatabaseLogEvent Severity.WARNING): type=SUPPRESSED_MESSAGES regex=journal: Suppressed line_number=18072 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 [13.51.64.58 | 10.0.0.86] (seed: False)
2021-01-25T20:14:13+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO | journal: Suppressed 1745 messages from /scylla.slice/scylla-server.slice
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:17:20.254: (CassandraStressLogEvent Severity.ERROR): type=ConsistencyError regex=Cannot achieve consistency level line_number=20177 node=Node longevity-cdc-100gb-4h-4-4-loader-node-a92a84b6-2 [13.48.67.201 | 10.0.0.240] (seed: False)
20:17:20.254 [cluster1-nio-worker-0] DEBUG com.datastax.driver.core.Connection - Connection[/10.0.0.196:9042-16, inFlight=1, closed=false] Response received on stream 19328 but no handler set anymore (either the request has timed out or it was closed due to another error). Received message is ERROR UNAVAILABLE: Cannot achieve consistency level for cl QUORUM. Requires 2, alive 1
2021-01-25 20:14:15.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=18108 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 [13.51.64.58 | 10.0.0.86] (seed: False)
2021-01-25T20:14:15+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 8] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x283cefc#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:16.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=18123 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 [13.51.64.58 | 10.0.0.86] (seed: False)
2021-01-25T20:14:16+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 6] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x2879d3c#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:12.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=25142 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 [13.48.71.151 | 10.0.3.211] (seed: False)
2021-01-25T20:14:12+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !WARNING | scylla: [shard 1] repair - repair id [id=5, uuid=7cebbbc8-97ec-4488-9a1f-dfcfd509534a] on shard 1, keyspace=cdc_test, cf=test_table_postimage_scylla_cdc_log, range=(-3654222017973422949, -3645643230408267529], got error in row level repair: std::runtime_error (repair id [id=5, uuid=7cebbbc8-97ec-4488-9a1f-dfcfd509534a] is aborted on shard 1)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:25.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=27146 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 [13.48.177.90 | 10.0.0.196] (seed: True)
2021-01-25T20:14:25+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 11] repair - repair id [id=6, uuid=cb03e36d-9911-43f6-b899-a401732ed6cd] on shard 11, keyspace=cdc_test, cf=test_table_preimage, range=(3184677132060271035, 3191470747596256232], got error in row level repair: std::runtime_error (repair id [id=6, uuid=cb03e36d-9911-43f6-b899-a401732ed6cd] is aborted on shard 11)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:19.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=12410 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 [13.49.225.189 | 10.0.2.15] (seed: False)
2021-01-25T20:14:19+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !WARNING | scylla: [shard 5] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x2879d3c#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:19.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=12411 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 [13.49.225.189 | 10.0.2.15] (seed: False)
2021-01-25T20:14:19+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !WARNING | scylla: [shard 5] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x285f70c#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:19.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=12421 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 [13.49.225.189 | 10.0.2.15] (seed: False)
2021-01-25T20:14:19+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !WARNING | scylla: [shard 9] repair - repair id [id=5, uuid=54f40911-d25d-4ead-afb0-0316d5a0be66] on shard 9, keyspace=cdc_test, cf=test_table, range=(18880742379052849, 40533130447798323], got error in row level repair: std::runtime_error (repair id [id=5, uuid=54f40911-d25d-4ead-afb0-0316d5a0be66] is aborted on shard 9)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:20.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=12446 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 [13.49.225.189 | 10.0.2.15] (seed: False)
2021-01-25T20:14:20+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !WARNING | scylla: [shard 7] repair - repair id [id=5, uuid=54f40911-d25d-4ead-afb0-0316d5a0be66] on shard 7, keyspace=cdc_test, cf=test_table_preimage_postimage, range=(-963135068110387736, -954451574865628589], got error in row level repair: std::runtime_error (repair id [id=5, uuid=54f40911-d25d-4ead-afb0-0316d5a0be66] is aborted on shard 7)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:48.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=12779 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 [13.49.225.189 | 10.0.2.15] (seed: False)
2021-01-25T20:14:48+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !WARNING | scylla: [shard 2] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x2879d3c#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:48.000: (DatabaseLogEvent Severity.ERROR): type=BACKTRACE regex=backtrace line_number=12780 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 [13.49.225.189 | 10.0.2.15] (seed: False)
2021-01-25T20:14:48+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !WARNING | scylla: [shard 2] seastar - Exceptional future ignored: seastar::rpc::closed_error (connection is closed), backtrace: 0x3c532ee#012 0x3c53760#012 0x3c53ae8#012 0x387eb67#012 0x2879d3c#012 0x38c97af#012 0x38ca997#012 0x38e9058#012 0x38950fa#012 /opt/scylladb/libreloc/libpthread.so.0+0x93f8#012 /opt/scylladb/libreloc/libc.so.6+0x101902#012 --------#012 seastar::continuation, seastar::rpc::source > >, netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:37.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=18382 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 [13.51.64.58 | 10.0.0.86] (seed: False)
2021-01-25T20:14:37+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 3] repair - repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] on shard 3, keyspace=cdc_test, cf=test_table_postimage, range=(219550252143665664, 236562153728247702], got error in row level repair: std::runtime_error (repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] is aborted on shard 3)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void
) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
2021-01-25 20:14:42.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=18432 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 [13.51.64.58 | 10.0.0.86] (seed: False)
2021-01-25T20:14:42+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 11] repair - repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] on shard 11, keyspace=cdc_test, cf=test_table_preimage_postimage_scylla_cdc_log, range=(-2842404103168958694, -2825574554437859504], got error in row level repair: std::runtime_error (put_row_diff: Repair follower=10.0.3.211 failed in put_row_diff hanlder, status=0)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void*) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0

2021-01-25 20:15:01.000: (DatabaseLogEvent Severity.ERROR): type=RUNTIME_ERROR regex=std::runtime_error line_number=18630 node=Node longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 [13.51.64.58 | 10.0.0.86] (seed: False)
2021-01-25T20:15:01+00:00 longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !WARNING | scylla: [shard 12] repair - repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] on shard 12 failed: std::runtime_error (repair id [id=4, uuid=a0bd8824-5f51-4ec1-bf4e-19b6f4dd4afa] is aborted on shard 12)
void seastar::backtrace(seastar::current_backtrace_tasklocal()::$_3&&) at ./build/release/seastar/./seastar/include/seastar/util/backtrace.hh:59
(inlined by) seastar::current_backtrace_tasklocal() at ./build/release/seastar/./seastar/src/util/backtrace.cc:86
seastar::current_tasktrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:135
seastar::current_backtrace() at ./build/release/seastar/./seastar/src/util/backtrace.cc:168
seastar::report_failed_future(std::__exception_ptr::exception_ptr const&) at ./build/release/seastar/./seastar/src/core/future.cc:210
(inlined by) seastar::report_failed_future(seastar::future_state_base::any&&) at ./build/release/seastar/./seastar/src/core/future.cc:218
seastar::future_state_base::any::check_failure() at ././seastar/include/seastar/core/future.hh:567
(inlined by) seastar::future_state >::clear() at ././seastar/include/seastar/core/future.hh:609
(inlined by) ~future_state at ././seastar/include/seastar/core/future.hh:614
(inlined by) ~future at ././seastar/include/seastar/core/future.hh:1337
(inlined by) ~ at ./message/messaging_service.cc:821
(inlined by) ~continuation at ././seastar/include/seastar/core/future.hh:750
(inlined by) seastar::continuation, seastar::rpc::source > >, seastar::future, seastar::rpc::source > > netw::do_make_sink_source(netw::messaging_verb, unsigned int, seastar::shared_ptr, std::unique_ptr >&)::{lambda(seastar::rpc::sink)#1}::operator()(seastar::rpc::sink)::{lambda(seastar::future >)#1}::operator()({lambda(seastar::rpc::sink)#1})::{lambda()#1}, seastar::future, seastar::rpc::source > > seastar::future::then_impl_nrvo<{lambda(seastar::future >)#1}, seastar::future, seastar::rpc::source > > >({lambda(seastar::future >)#1}&&)::{lambda(seastar::internal::promise_base_with_type, seastar::rpc::source > >&&, {lambda(seastar::future >)#1}&, seastar::future_state&&)#1}, void>::run_and_dispose() at ././seastar/include/seastar/core/future.hh:771
seastar::reactor::run_tasks(seastar::reactor::task_queue&) at ./build/release/seastar/./seastar/src/core/reactor.cc:2220
(inlined by) seastar::reactor::run_some_tasks() at ./build/release/seastar/./seastar/src/core/reactor.cc:2629
seastar::reactor::run() at ./build/release/seastar/./seastar/src/core/reactor.cc:2788
operator() at ./build/release/seastar/./seastar/src/core/reactor.cc:3979
(inlined by) void std::__invoke_impl (inlined by) std::enable_if::type std::__invoke_r (inlined by) std::_Function_handler::_M_invoke(std::_Any_data const&) at /usr/lib/gcc/x86_64-redhat-linux/10/../../../../include/c++/10/bits/std_function.h:291
std::function (inlined by) seastar::posix_thread::start_routine(void*) at ./build/release/seastar/./seastar/src/core/posix.cc:60
?? ??:0
?? ??:0
```

====================================

Restore Monitor Stack command: $ hydra investigate show-monitor a92a84b6-e2e9-4d07-8aa6-a85bf96363f8
Show all stored logs command: $ hydra investigate show-logs a92a84b6-e2e9-4d07-8aa6-a85bf96363f8

Logs:
grafana - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_230539/grafana-screenshot-longevity-cdc-4h-test-scylla-per-server-metrics-nemesis-20210125_230927-longevity-cdc-100gb-4h-4-4-monitor-node-a92a84b6-1.png
grafana - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_230539/grafana-screenshot-overview-20210125_230539-longevity-cdc-100gb-4h-4-4-monitor-node-a92a84b6-1.png
grafana - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_231434/grafana-screenshot-longevity-cdc-4h-test-scylla-per-server-metrics-nemesis-20210125_231748-longevity-cdc-100gb-4h-4-4-monitor-node-a92a84b6-1.png
grafana - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_231434/grafana-screenshot-overview-20210125_231434-longevity-cdc-100gb-4h-4-4-monitor-node-a92a84b6-1.png
db-cluster - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/db-cluster-a92a84b6.zip
loader-set - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/loader-set-a92a84b6.zip
monitor-set - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/monitor-set-a92a84b6.zip
sct-runner - https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/sct-runner-a92a84b6.zip

bug

All 23 comments

When one of the node is down, repair is supposed to fail because one of the node is down. The repair continues as best effort mode, meaning repair with whatever node is live. The repair will report the repair is failed in the end.

The removenode ops failed because other nodes marked the coordinator node to run the removenode as DOWN during the removenode ops.

14967 2021-01-25T20:08:12+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 13] repair - repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 13 failed - 109 ranges failed
14968 2021-01-25T20:08:12+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 9] repair - Repair 504 out of 517 ranges, id=[id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7], shard=9, keyspace=system_trace      s, table={events, sessions, node_slow_log, sessions_time_idx, node_slow_log_time_idx}, range=(8540746862393288986, 8549954687201168966], peers={10.0.1.116}, live_peers={}, status=skipped
14969 2021-01-25T20:08:12+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 9] repair - Repair 505 out of 517 ranges, id=[id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7], shard=9, keyspace=system_trace      s, table={events, sessions, node_slow_log, sessions_time_idx, node_slow_log_time_idx}, range=(8682823651964586273, 8693545045829343754], peers={10.0.1.116}, live_peers={}, status=skipped
14970 2021-01-25T20:08:12+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !WARNING | scylla: [shard 13] repair - repair id [id=2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 13 failed: std::runtime_error (repair id [id      =2, uuid=18943d16-c86b-4f45-b1fa-9d01922ec5d7] on shard 13 failed to repair 109 sub ranges)
$ cat longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-*/messages.log|grep 10.0.0.196|grep DOWN
2021-01-25T20:14:03+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:15:20+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:16:07+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:17:19+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:18:57+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:19:58+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-2 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:15:02+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:17:17+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:18:57+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:20:03+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-4 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:14:48+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:17:57+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:18:54+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-5 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:14:59+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:16:36+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:17:23+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:18:59+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL
2021-01-25T20:20:02+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-6 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.0.196 is now DOWN, status = NORMAL

The metrics looks horrible.

image

The removenode ops failed because other nodes marked the coordinator node to run the removenode as DOWN during the removenode ops.

@asias why did that happen? what is the root cause?

@aleksbykov hydra investigate show-monitor failed for me.

$ hydra investigate show-monitor a92a84b6-e2e9-4d07-8aa6-a85bf96363f8
Docker version 20.10.2, build 2291f61
There is scylladb/hydra:v0.85 in local cache, use it.
Obtaining QA SSH keys...
QA SSH keys obtained.
Making sure the ownerships of results directories are of the user
Going to run './sct.py investigate show-monitor a92a84b6-e2e9-4d07-8aa6-a85bf96363f8'...

We trust you have received the usual lecture from the local System
Administrator. It usually boils down to these three things:

    #1) Respect the privacy of others.
    #2) Think before you type.
    #3) With great power comes great responsibility.

[sudo] password for asias: 
Obtaining QA SSH keys...
QA SSH keys obtained.
New directory created: /home/asias/sct-results/20210202-020900-603914-investigate-show-monitor
Search monitoring stack archive files for test id a92a84b6-e2e9-4d07-8aa6-a85bf96363f8 and restoring...
Checking that docker is available...
Docker is available
Restoring monitoring stack from archive a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/monitor-set-a92a84b6.zip
Download file https://cloudius-jenkins-test.s3.amazonaws.com/a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/monitor-set-a92a84b6.zip to directory /tmp/tmpb8oxh6gp
Downloading a92a84b6-e2e9-4d07-8aa6-a85bf96363f8/20210125_232251/monitor-set-a92a84b6.zip from cloudius-jenkins-test
Downloaded finished
Error executing command: "cd /tmp/tmpb8oxh6gp/scylla-monitoring-src;
            ./start-all.sh             -g 33201 -m 32935 -p 51663             -s /tmp/tmpb8oxh6gp/scylla-monitoring-src/config/scylla_servers.yml             -n /tmp/tmpb8oxh6gp/scylla-monitoring-src/config/node_exporter_servers.yml             -d /tmp/tmpb8oxh6gp/monitoring_data_dir/20210125T232322Z-78629a0f5f3f164f -v 4.4"; Exit status: 1
Start docker containers [try #1]

The removenode ops failed because other nodes marked the coordinator node to run the removenode as DOWN during the removenode ops.

@asias why did that happen? what is the root cause?

The system is heavily loaded. I need the monitor data to dig more. It is probably related to CDC because we do not see such huge drops without CDC. HH can also contribute extra workload like we saw: https://github.com/scylladb/scylla/issues/7976.

I suggest aleksbykov to give me a sct branch and a run.sh script inside, so I can modify and run the test myself. I want to limit scope of the tests. I also want to understand the exact workload used.

Test1:
1) Run c-s with cdc workload
2) Kill one node
3) Run repair

Test 2:
1) Run c-s with cdc workload
2) Kill one node
3) Run removenode ops

@asias i restored the monitoring stack on aws instance: http://13.51.48.255:3000/d/gFYbtaYGz/my_gemini-scylla-per-server-metrics-nemesis-master?orgId=1&from=1611601085679&to=1611618200744
(If you will not see results, filter for last 30 days)

Thanks. I can see the monitor now.

@aleksbykov

The test yaml says:

stress_cmd: [ "cassandra-stress user no-warmup profile=/tmp/cdc_profile.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100",
              "cassandra-stress user no-warmup profile=/tmp/cdc_profile_preimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100",
              "cassandra-stress user no-warmup profile=/tmp/cdc_profile_postimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100",
              "cassandra-stress user no-warmup profile=/tmp/cdc_profile_preimage_postimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100"
             ]

n_db_nodes: 6
n_loaders: 2
n_monitor_nodes: 1

instance_type_db: 'i3.4xlarge'

There was only 2 loaders. Will each loader run all the 4 c-s cmds at the same time (2*4 c-s in parallel)? Also, I see not QPS limit in the c-s cmd, so the loaders will run at full speed to load the cluster to the extreme?

stress_cmd: [ "cassandra-stress user no-warmup profile=/tmp/cdc_profile.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100",
              "cassandra-stress user no-warmup profile=/tmp/cdc_profile_preimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100",
              "cassandra-stress user no-warmup profile=/tmp/cdc_profile_postimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100",
              "cassandra-stress user no-warmup profile=/tmp/cdc_profile_preimage_postimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate threads=100"
             ]

node 10.0.1.116 is being shut down

 2021-01-25T20:06:55+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] gossip - InetAddress 10.0.1.116 is now DOWN, status = shutdown

At time 20:09:08, node1 10.0.0.196 was the coordinator to run the removnoede cmd, and the repair for remoovenode ops started

 2021-01-25T20:09:08+00:00  longevity-cdc-100gb-4h-4-4-db-node-a92a84b6-1 !INFO    | scylla: [shard 0] storage_service - removenode[680d8ebf-f2bd-4e15-9a4b-f670782c86ad]: Started removenode operation, removing node=10.0.1.116,       sync_nodes={10.0.2.15, 10.0.1.199, 10.0.0.86, 10.0.3.211, 10.0.0.196}, ignore_nodes=[]

A spike of sstable reads and streaming io was observed.

image

image

No spike of hints during the problematic period around 20:09

image

Spike of cache misses:

image

Despite all those spikes, I do not understand why the CQL dropped to floor. @slivne @avikivity Do you have any insights? The monitor is still alive.

The minimum reproducer would be:

start 6 nodes, run c-s above, kill one node, run removenode

BTW, I think it is better to rate limit c-s to generate a constant workload on the loader with the throttle option, like

cassandra-stress user no-warmup profile=data_dir/cdc_profile_preimage.yaml ops'(insert=2,read1=1,update_number=1,update_name=1,delete1=1)' cl=QUORUM duration=200m -port jmx=6868 -mode cql3 native -rate 'threads=100 throttle=5000/s'

Tune the throttle option according to the cluster capacity, e.g., to load the system 80%.

Otherwise, c-s is sending requests as fast as it can.

@slivne ping

The issue reproduced with Scylla version 4.5.dev-0.20210206.e1261d10f with build-id 6c45f8a236b89fa9dbdcd135bb26dccf4ffaf154 (ami-0e75b72c70e2d89a5(eu-north-1) during job: https://jenkins.scylladb.com/view/master/job/scylla-master/job/longevity/job/longevity-100gb-4h-test/300/

Again during the nemesis TerminateAndRemove after scylla was stopped on node8 and instance was terminated, Repair and nodetool remove were failed.

db logs: https://cloudius-jenkins-test.s3.amazonaws.com/db26ca4d-328c-4797-8b17-8910694acd11/20210207_050405/db-cluster-db26ca4d.zip

@asias I added script which allow to start the repo:
https://github.com/aleksbykov/scylla-cluster-tests/tree/repo_7965
file: repo_7965.sh

The issue reproduced with Scylla version 4.5.dev-0.20210206.e1261d10f with build-id 6c45f8a236b89fa9dbdcd135bb26dccf4ffaf154 (ami-0e75b72c70e2d89a5(eu-north-1) during job: https://jenkins.scylladb.com/view/master/job/scylla-master/job/longevity/job/longevity-100gb-4h-test/300/

Again during the nemesis TerminateAndRemove after scylla was stopped on node8 and instance was terminated, Repair and nodetool remove were failed.

db logs: https://cloudius-jenkins-test.s3.amazonaws.com/db26ca4d-328c-4797-8b17-8910694acd11/20210207_050405/db-cluster-db26ca4d.zip

@asias I added script which allow to start the repo:
https://github.com/aleksbykov/scylla-cluster-tests/tree/repo_7965
file: repo_7965.sh

Thanks.

Refs: https://github.com/scylladb/scylla/issues/8030 CDC uses TWCS strategy.
I think it is a combination of repair triggered on all nodes by removenode ops + TWCS triggered by CDC workload + unlimited cql workload generated by c-s (no throttle parameter) overwhelmed the node. Then the node was marked as down and failed the removenode ops.

@asias tried to have a look and I am lost

(assuming this is without repair node operation and repair is done prior to removenode - like users are asked todo)

  • One node was stopped a nodetool repair was run - that repair failed - yet we need it to succeed (without repair based operations) - how can we make it succeed by ignoring the failed node? I think we have been chasing this before as well - and I am not sure I got an answer how to ignore the failed node during repair. @asias - how can we do that ?

  • The repair is indeed running in the background (at least based on the sum(irate(scylla_repair_rx_row_nr[30s])) - ontop of that - we are able todo a removenode - really strange the ranges to repair and the token range ownerhsip are changing for repair under its feet and it continues to run ?

repair_while_remove_node

We really need to find a way to run repair without the node that is going to be removed.

Next trying to see whats going on during the part we do not have any CQL traffic.

@bhalevy / @raphaelsc

zoom_in

io1

io2

io3

io4

task_group_vs_bytes

So its seems compaction is taking full control of our cpus but the strange thing is that it does not align with disk_io - thats strange - how come compaction is using ALL CPU but is not doing almost ANY IO ?

compactions

and we do have compactions running like crazy - but still - where is the IO associated with compaction ?

Based on the task quota violations we may have large stalls embedded inside - we need to set the stall detector to lower values to find them (50ms is expected to find something)

Cc @raphaelsc

* One node was stopped a nodetool repair was run - that repair failed - yet we need it to succeed (without repair based operations) - how can we make it succeed by ignoring the failed node? I think we have been chasing this before as well - and I am not sure I got an answer how to ignore the failed node during repair. @asias - how can we do that ?

See here: https://github.com/scylladb/scylla/issues/7806#issuecomment-751922355.

I suggested adding an option to ignore dead nodes explicitly. I got no response yet.

* The repair is indeed running in the background (at least based on the `sum(irate(scylla_repair_rx_row_nr[30s]))` - ontop of that - we are able todo a removenode - really strange the ranges to repair and the token range ownerhsip are changing for repair under its feet and it continues to run ?

The repair is triggered by removenode ops, not the regular repair triggered by user.

After

commit 829b4c14380020fa4058335c4c43cefec135b3de
Author: Asias He <[email protected]>
Date:   Mon Nov 16 16:13:44 2020 +0800

    repair: Make removenode safe by default

We always use repair based node ops for removenode operations.

we do have compactions running like crazy - but still - where is the IO associated with compaction ?

@raphaelsc suspects an issue with backlog tracker.
See #6054

If compaction aggressiveness doesn't match the actual amount of compaction backlog (i.e. compaction work left), then we very likely have a problem in the controller. Given that this issue involves CDC, which in turn uses TWCS, it may potentially be #6054. The patch which fixed it was merged into next. Once it reaches master, we can try another run to see if this issue was fixed. /CC @asias @aleksbykov

Offending commit (829b4c1) reverted on 4.4 only.

Can be tested on 4.4.rc3

Was this page helpful?
0 / 5 - 0 ratings

Related issues

avikivity picture avikivity  路  4Comments

tzach picture tzach  路  7Comments

amoskong picture amoskong  路  6Comments

duarten picture duarten  路  5Comments

gnumoreno picture gnumoreno  路  5Comments