Faas: Architecture: structured logging

Created on 10 Mar 2018 · 13Comments · Source: openfaas/faas

After reviewing the codebase with @stefanprodan we want to look into options for structured logging for OpenFaaS. Structured logging writes logs in a specific format that can be parsed easily via fluentd or some log aggregator.

Options mentioned:

glog format
JSON format
KVP format

Glog has some questionable design decisions and mixed opinion within the Go community.

Logrus used by Docker has issues with the vendoring disaster by Sirupsen and causes no end of headaches due to renaming the case.

Structured logging may allow us to minimize some of the duplication of logged events - i.e.

Ingress/Nginx logs access to a function
The gateway may also log this
The provider may log this
The function container may log this

We could have 4x the data - and if the execution was 200 OK, it's probably pointless collecting the data in 4x log files. In the cloud the effect of 4x logs is expensive. Stefan explained that logging to stderr counts as a critical error on GKE and costs more where as logging to stdout is cheaper. Glog supports redirecting the logs to stdout - Go's log package which we use does not.

We already aggregate Prometheus "RED" metrics on function calls, so this may be an adequate replacement for writing logs for success messages.

areapi arewatchdog desigreview sizxl skiladvanced

Source

alexellis

👍1

All 13 comments

What would this look like in the code? The advantage is that we have 4 times less log duplication, which makes sense. But instead of the gateway+provider collecting the logs, what service is collecting them? Is another container running in the deployment which does all the logging?

ericstoekl on 20 Mar 2018

On Kubernetes you would send the logs to stderr and use a format that Fluentd understands like json or glog. Fluentd will forward the logs to the default could provider storage (Stackdriver, Cloudwatch, etc) or to Elasticsearch if you run on-prem. There is no need for us to ship a log collector on Kubernetes. Docker Swarm comes with several log drivers that will do the same.

stefanprodan on 20 Mar 2018

I've tested several log packages and I would go with zerolog and json format.

Here is an example https://github.com/stefanprodan/k8s-podinfo/blob/master/cmd/podinfo/main.go

stefanprodan on 20 Mar 2018

@ericstoekl I think there are two unrelated items I've represented in the issue.

Reducing log verbosity

Less cost running at high load in the cloud

Using structured (machine readable) logs

Logs can be indexed/searched in a repeatable way.

Errors for instance can be picked up and used to trigger alerts.

@stefanprodan are those log numbers your own stats? If not then maybe you can link to where you got the table?

What does the podinfo output look like on the console?

alexellis on 2 Apr 2018

Just chiming in with a datapoint:

For our use case (on Kubernetes), simply wrapping the function-emitted log message in JSON and attaching a loglevel has been sufficient; it's all that's necessary to integrate into our existing log pipelines. It isn't amazingly flexible, but it gets the job done.

For example, if someone's using the http executor and writes to STDERR, they get {"level": "ERROR", "message:" ... } and STDOUT gets {"level": "INFO", "message": ...}.

For the case of the streaming executor, where all feedback needs to come across STDERR, I'd toyed around with the idea of expanding the number of available log levels by allowing functions to drop hints to watchdog in the emitted message, but I don't know if that'd do anything other than cause confusion without some sort of function-SDK-level enforcement of message format.

In general, though, we don't really care about logging anything other than ERRORs. Ideally anything that would be logged at INFO should be captured as time-series data.

gkuchta on 3 Apr 2018

👍1

@gkuchta thanks for adding this. You said that you transport your logs over Kafka? Is that with fluentd collecting?

alexellis on 3 Apr 2018

@alexellis I've added a link to https://github.com/rs/zerolog You can see the console output and benchmark there.

stefanprodan on 3 Apr 2018

@alexellis that's correct. container -> fluentd -> kafka -> Elastic Search

gkuchta on 4 Apr 2018

❤1

We touched upon this in the contributors call today.

alexellis on 1 Aug 2018

@alexellis , @ericstoekl, @stefanprodan re:

Reducing log verbosity
Using structured (machine readable) logs

Perhaps I misunderstand, but I think the discussion conflates collecting and presenting the log data.
For collecting the log data, we could use systemd. It's easy to overlook and take for granted, but it's likely to be around for a long time and the binary format avoids the character-encoding hilarity that comes with "text-based" content.

blaise-sumo on 9 Oct 2019

👎1

We want logs to be accessible from kubectl logs, docker logs, stackdriver, cloudwatch, kibana, datadog etc Using the systemd format would make OpenFaaS a non-cloud-native platform :))

stefanprodan on 9 Oct 2019

I've had good success at work with user-go/zap for structured logs. I've not used zerolog but I don't imagine the differences are significant.

jmickey on 16 Oct 2019

Hi @stefanprodan , I think I'm not explaining myself properly.
I agree with your point about accessing the data.
I was referring to recording the data using one of the established libraries like

https://github.com/coreos/go-systemd
https://godoc.org/github.com/ssgreg/journald
Just from browsing the source code, I suspect we could promote good logging discipline if we choose a package and some sensible settings.

blaise-sumo on 16 Oct 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Question: How can I set custom headers on function responses?

ndarilek · 3Comments

Support: Help needed using private Docker registry with Swarm

matthewdolman · 5Comments

Reset the credentials

ohld · 6Comments

FaaS -> OpenFaaS

alexellis · 7Comments

UI: tutorial panel for functions

alexellis · 7Comments