Memcached – Caching Beyond RAM: Riding the Cliff

gstaro · on Feb 14, 2019

The way I understand this proposal, the idea is to find at which request rate the latency of a storage configuration (eg, 2x Optane) explodes. If your request rate is above that rate, get more RAM. I'm not sure I understand how to figure out how much more RAM? Until the latency goes down I guess?

Also what about diurnal patterns? Can there be a way to dynamically react and save power..

Interesting reference to the morning paper. But don't get how that fits into the story?

https://blog.acolyer.org/2018/10/26/robinhood-tail-latency-a...

donavanm · on Feb 14, 2019

This is well explored space in more tradtional storage land. Look in to cumulative distribution functions. This is going to show you your workload distribution and effective “working set” to optimize for in a time period. This will get you to a place where youre optimizing total cost vs performance as a function.

Youll also want to be cognizant of convex “shoulders” in the distribution that will trip up naive optimization algorithms. I do t have the link offhand, but search for hill climbing in relation to CDFs. Some related work might be in the relatively unexplored “cache insertion” problem area. Check out TinyLFU as an example of knowing what to cache being more beneficial than what to evict.

For more advanced techniques look in to some of the published work from places like Coho Data. They had a great paper back at usenix 2015ish around optimizing placement in dynamic workloads across different storage media.

And lastly experimentation is great to prove a hypotheis, but not the most effective discovery. Youll want to get representative workload traces and use those to replay/simulate against different constraints. Check out Fio and its IO trace capabilities for an example.

dormando · on Feb 14, 2019

1) Super workload dependent unfortunately. Probably "easiest" way is to set up test systems with enough RAM and disk space. Then either slowly reduce the RAM or start from a low point and slowly increase until you hit your target; there's a command for memcached to live-adjust the RAM limit.

2) dunno! For RAM that's super hard. for disk I'm hoping tiering will do something. ie; for on-peak you can add extra NBD space then remove it off-peak.

3) referenced tail latency because most of the testing focuses on displaying latency outliers and how the system generally minimizes them.

0xbadcafebee · on Feb 14, 2019

I thought the RAM estimation was (1) benchmark your backend, (2) benchmark your memcache<->backend, (3) measure peak traffic performance requirements, (4) provide enough RAM that you reduce the latency to acceptable levels during peak traffic. If you don't have measurements of the first three you're shooting from the hip. It is workload dependent, but the more measurements you have, the better you can estimate. (Used to be mandatory so you could do trend analysis and purchase your new gear in time for inevitably outgrowing the old stuff)

dormando · on Feb 14, 2019

Ah my point of reference was more "how do you go from memcached RAM to RAM + disk?" not "how do you go from no memcached to memcached".

If you don't have any cache layer at all, that's a standard approach.

0xbadcafebee · on Feb 14, 2019

Wouldn't the RAM + disk just be an extension of that method, though? It's like in the article, you test a subset of the system (RAM+disk) to find its limits, then you compare that to your other measurements and load requirements, and replace RAM with RAM+disk where it's cost and performance appropriate. But I might have missed part of the analysis

dormando · on Feb 14, 2019

The article was significantly cut down for brevity, so any detail of how to find the right balance is beyond the scope :)

The original question was that it can be really hard to determine how much adding RAM will offload from the disk. IE: with extstore recent objects are served by RAM and never disk, so they don't count against your IO balance. If you're coming from a RAM backed system that has 500k requests per second, it's going to take some creative introspection to figure out how much RAM you'd need to keep the disk accesses below 400k, or whatever your target is.

Along those lines I just suggested a simple experiment. which is to just take an existing fully specced instance, add a disk, and test a few RAM settings to see how it looks. Most actual users of extstore have just done that since they're extending/replacing an existing cluster.

There're actually other ways you can simply read the numbers out of memcached but it just takes more effort or familiarity with its internals.

sitkack · on Feb 14, 2019

P99 latency at a constant read/write rate is exactly the test that Aerospike ACT benchmark performs.

https://github.com/aerospike/act

Aerospike has been engineered to perform well in this exact space. P99 <8ms over multiple TB of durable storage under a constant read/write rate.

xenadu02 · on Feb 14, 2019

I believe the objective is to replace expensive RAM with less expensive SSD if your workload is suitable for it. It appears there are workloads for which that is an acceptable cost/latency tradeoff.

baybal2 · on Feb 14, 2019

Bad hardware/badly configured hardware can add more latency than software even if it runs in the same rack.

RDMA NIC + nvme got 90%+ latency reduction over noname NIC + sata ssd for Alibaba in a large DC environment.

Rafuino · on Feb 14, 2019

Do you have something more detailed I could read about that?

ksec · on Feb 14, 2019

Not trolling but a real Questions.

Under what scenario would Memcached be better than Redis in 2019 if you are starting new?

cbg0 · on Feb 14, 2019

Specifically tied to this discussion: extstore is free on memcached, meanwhile you need to pay for Redis Enterprise if you want similar functionality (Redis on Flash), as far as I know.

liveoneggs · on Feb 14, 2019

memcached has, basically, zero configuration, scales to all of your processors, is known/stable/simple, and never exposed open lua shells to the internet?

memcached does one thing. redis does a bunch. They are different.

scarface74 · on Feb 14, 2019

Memcached scales vertically. Redis is single threaded. Multiple cores are wasted.

indeyets · on Feb 14, 2019

An official position is that one should run multiple Redis servers per machine and scale by splitting datasets between them

scarface74 · on Feb 14, 2019

Which doesn’t necessarily help if you have certain shards that are hit heavier and it’s more maintenance.

Also, if you are running on prem, you can optimize your VM for cores/RAM. At least on AWS, you can’t perfectly optimize your core count versus RAM vs speed.

But then again, if I were running on prem, I might use Redis because it handles use cases by itself that it would take Memcached+ other servers to manage. With cloud hosting, I could use managed purpose built services if I needed more features than Memcached can handle by itself.

dlojudice · on Feb 14, 2019

not sure if Redis has such thing but the failover mechanism through key hashing by memcached clients (not on the server) is something simple and effective (if you have enough nodes)

baybal2 · on Feb 14, 2019

Latency matter when you have a lot of roundtrips -- ie databases