We are running Vault 1.4.2 with role and cred creation on AWS IAM. We have been encountering a “failed to commit WAL entry” error for the following reason.

Delete 127.0.0.1:8500/v1/kv/vault/logical/a3842a67-8dc7-0cba-044e-1e020cc22f8a/wal/7c402cea-73d7-21c4-6bcc-6c9c7650626c: context canceled

We have found the AWS role and cred are created successfully when we refer to the cloudtrail logs.

Our backing store is Consul 1.7.3 and its logs do not show any problems around the same time frame.

2021-10-05T18:07:43.676Z [INFO]  agent.server.fsm: snapshot created: duration=31.71µs
 agent.server.fsm: snapshot created: duration=31.71µs
2021-10-05T18:07:43.676Z [INFO]  agent.server.raft: starting snapshot up to: index=1388953561
2021-10-05T18:07:43.676Z [INFO]  snapshot: creating new snapshot: path=/mnt/consul/raft/snapshots/30455-1388953561-1633457263676.tmp
 agent.server.raft: starting snapshot up to: index=1388953561
 snapshot: creating new snapshot: path=/mnt/consul/raft/snapshots/30455-1388953561-1633457263676.tmp
2021-10-05T18:08:01.125Z [WARN]  snapshot: found temporary snapshot: name=30335-253014623-1595344187238.tmp
 snapshot: found temporary snapshot: name=30335-253014623-1595344187238.tmp
2021-10-05T18:08:01.125Z [INFO]  snapshot: reaping snapshot: path=/mnt/consul/raft/snapshots/30455-1388917430-1633456865863
 snapshot: reaping snapshot: path=/mnt/consul/raft/snapshots/30455-1388917430-1633456865863
 agent.server.raft: compacting logs: from=1388925941 to=1388945178
2021-10-05T18:08:01.527Z [INFO]  agent.server.raft: compacting logs: from=1388925941 to=1388945178
2021-10-05T18:08:01.546Z [INFO]  agent.server.raft: snapshot complete up to: index=1388953561
 agent.server.raft: snapshot complete up to: index=1388953561

When looking into the commit failed error I see the following message. What I do not understand is what does it really mean and how can we fix it.

// Remove the WAL entry, we succeeded! If we fail, we don't return  
	// the secret because it'll get rolled back anyways, so we have to return  
	// an error here.  
	if err := framework.DeleteWAL(ctx, s, walID); err != nil {  
		return nil, fmt.Errorf("failed to commit WAL entry: %w", err)  
	}  

Does anyone have any guidance into where I should be looking next?

Source link