We are running Vault 1.4.2 with role and cred creation on AWS IAM. We have been encountering a “failed to commit WAL entry” error for the following reason.
Delete 127.0.0.1:8500/v1/kv/vault/logical/a3842a67-8dc7-0cba-044e-1e020cc22f8a/wal/7c402cea-73d7-21c4-6bcc-6c9c7650626c: context canceled
We have found the AWS role and cred are created successfully when we refer to the cloudtrail logs.
Our backing store is Consul 1.7.3 and its logs do not show any problems around the same time frame.
2021-10-05T18:07:43.676Z [INFO] agent.server.fsm: snapshot created: duration=31.71µs
agent.server.fsm: snapshot created: duration=31.71µs
2021-10-05T18:07:43.676Z [INFO] agent.server.raft: starting snapshot up to: index=1388953561
2021-10-05T18:07:43.676Z [INFO] snapshot: creating new snapshot: path=/mnt/consul/raft/snapshots/30455-1388953561-1633457263676.tmp
agent.server.raft: starting snapshot up to: index=1388953561
snapshot: creating new snapshot: path=/mnt/consul/raft/snapshots/30455-1388953561-1633457263676.tmp
2021-10-05T18:08:01.125Z [WARN] snapshot: found temporary snapshot: name=30335-253014623-1595344187238.tmp
snapshot: found temporary snapshot: name=30335-253014623-1595344187238.tmp
2021-10-05T18:08:01.125Z [INFO] snapshot: reaping snapshot: path=/mnt/consul/raft/snapshots/30455-1388917430-1633456865863
snapshot: reaping snapshot: path=/mnt/consul/raft/snapshots/30455-1388917430-1633456865863
agent.server.raft: compacting logs: from=1388925941 to=1388945178
2021-10-05T18:08:01.527Z [INFO] agent.server.raft: compacting logs: from=1388925941 to=1388945178
2021-10-05T18:08:01.546Z [INFO] agent.server.raft: snapshot complete up to: index=1388953561
agent.server.raft: snapshot complete up to: index=1388953561
When looking into the commit failed error I see the following message. What I do not understand is what does it really mean and how can we fix it.
// Remove the WAL entry, we succeeded! If we fail, we don't return
// the secret because it'll get rolled back anyways, so we have to return
// an error here.
if err := framework.DeleteWAL(ctx, s, walID); err != nil {
return nil, fmt.Errorf("failed to commit WAL entry: %w", err)
}
Does anyone have any guidance into where I should be looking next?