|
Progressive JPEG posted:avoid counters imo counters are fine if they reset on a boundary like 30 seconds. then you can get a rate
|
# ¿ Feb 25, 2019 19:03 |
|
|
# ¿ May 14, 2024 00:26 |
|
also we write all our metrics to a kinesis stream and consume them elsewhere to put them in an influxdb database, which is queried using grafana. doesn't have any perceptible overhead.
|
# ¿ Feb 25, 2019 19:04 |
|
uncurable mlady posted:influxdb more like refluxdb because it gives you heartburn trying to run it i fired it up and its been running since, seems braindead simple?
|
# ¿ Feb 25, 2019 21:25 |
|
r u ready to WALK posted:influx is easy until you start trying to customize it with rollup policies, multiple retention schedules and other things that aren't part of the default settings yeah we don't use the real-time metrics for anything than just monitoring. so we set the retention policy to 3 months and call it a day. we have a few metrics that are condensed from a continuous query but they never have a problem? only issue we ever had was when someone decided to use a random GUID as a tag and polluted a DB but it wasn't a big deal, i just deleted them.
|
# ¿ Feb 25, 2019 23:20 |
|
opentracing is cool
|
# ¿ Apr 29, 2019 00:56 |
|
you want tracing, not just metrics.
|
# ¿ Jul 9, 2019 17:09 |
|
we use the opentracing api for everything and use jaeger as our backend which feeds it into elasticsearch its cool when someone used a constant sampler (samples every trace) and we were producing 10gb of traces each day in our test environment. pro-tip: use a probabilistic sampler or something that says "sample 5% of traces" instead.
|
# ¿ Jul 9, 2019 23:33 |
|
a good idea is to make it configurable at run-time so you can adjust it in cases liek that
|
# ¿ Jul 10, 2019 22:20 |
|
no i mean, that it can be adjusted. like you have some sort of external way of updating the values. that way you can just send a command to reduce the sample rate
|
# ¿ Jul 10, 2019 22:41 |
|
my stepdads beer posted:vector.dev looks nice it looks neat but also they have these performance comparisons and they aren't even close to feature complete, yet.
|
# ¿ Jul 24, 2019 14:32 |
|
I'm picturing someone calling a customer service rep with billing questions and the person on the line saying "hold on for one second, sir" and then SSHing into a server and grepping logs
|
# ¿ Jun 12, 2020 16:12 |
|
|
# ¿ May 14, 2024 00:26 |
|
hell yeah
|
# ¿ Jul 28, 2020 16:53 |