Re: [bitfolk] Support this weekend / Ubuntu Lucid LTS releas…

Top Page

Reply to this message
Author: James Gregory
Date:  
Subject: Re: [bitfolk] Support this weekend / Ubuntu Lucid LTS release
be like so I&#39;ll probably guess wrong.<br>
<br>
- Find some means of using SSDs as a form of tiered storage<br>
=A0=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>
<br>
=A0We could continue deploying the majority of your storage from SATA<br>
=A0disks while also employing SSDs to cache these slower disks in<br>
=A0some manner.<br>
<br>
=A0The idea is that frequently-accessed data is backed on SSD whereas<br>
=A0data that is accessed less often is left on the larger-capacity<br>
=A0SATA, and *this remains transparent to the end user*, i.e. the VM.<br>
<br>
=A0This is not a new idea; plenty of storage hardware already does<br>
=A0it, ZFS can do it and so can BTRFS.<br>
<br>
=A0Advantages:<br>
<br>
=A0- For whatever benefit there is, everyone gets to feel it. If done<br>
=A0 =A0right, any VM that needs more IOPs should get more IOPs.<br>
<br>
=A0- Expensive SSDs purchased can be used immediately, in full.<br>
<br>
=A0Disadvantages:<br>
<br>
=A0- Since we can&#39;t use ZFS or expensive storage hardware, any<br>
=A0 =A0short-term solution is likely to be rather hacky. Do we want to<br>
=A0 =A0be pioneers here? This is your data.<br>
<br>
=A0- Customers with VMs that don&#39;t have heavy IO requirements (most)<b=
r>
=A0 =A0will be subsidising those who *do* have heavy IO requirements.<br>
=A0 =A0It&#39;s very unlikely we will put prices up, but SSDs are not free=
<br>
=A0 =A0so it has the effect of delaying the usual progression of<br>
=A0 =A0more-for-less that this type of service goes through.[1]<br>
<br>
=A0- Beyond what might be quite a blunt instrument, customers will<br>
=A0 =A0have no way to request faster storage and rely on it being<br>
=A0 =A0present. You have &quot;some storage&quot; and if that storage isn&=
#39;t<br>
=A0 =A0performing as fast as you would like, all we would be able to do<br=
>

=A0 =A0is try to see why it&#39;s not being cached on SSD.<br>
<br>
- Both?<br>
=A0=3D=3D=3D=3D=3D<br>
<br>
=A0Perhaps there is some way to do both? Maybe to start with using the<br>
=A0whole SSD as cache, but as requests to purchase SSD-backed storage<br>
=A0come in the cache could be reduced?<br>
<br>
=A0Advantages:<br>
<br>
=A0- Again everyone feels the benefit immediately and hardware isn&#39;t<b=
r>
=A0 =A0wasted.<br>
<br>
=A0- If the customer needs to buy SSD-backed storage then they can.<br>
<br>
=A0Disadvantages:<br>
<br>
=A0- If the caching is good enough then no one would feel the need to<br>
=A0 =A0buy SSD anyway, so why add complexity?<br>
<br>
=A0Questionable:<br>
<br>
=A0- If people buy all of the SSD, does that reduce caching benefit<br>
=A0 =A0to zero and suddenly screw everyone else over?<br>
<br>
=A0 =A0Presumably SSD-backed storage could be priced such that if a lot<br=
>

=A0 =A0of people did buy it, it would be economical to go out and buy a<br=
>

=A0 =A0pair of larger ones and swap them over without downtime[2].<br>
<br>
So, if anyone has any thoughts on this I&#39;d be interested in hearing<br>
them.<br>
<br>
If you had an IO latency problem, would you know how to diagnose it<br>
to determine that it was something you were doing as opposed to<br>
&quot;BitFolk&#39;s storage is overloaded but it&#39;s not me&quot;?<br>
<br>
If you could do that, would you be likely to spend more money on<br>
SSD-backed storage?<br>
<br>
If we came to you and said that your VPS service was IO-bound and<br>
would run faster if you bought some SSD-backed storage, do you think<br>
that you would?[3]<br>
<br>
My gut feeling at the moment is that while I would love to be<br>
feeding the geek inside everyone and offering eleventy-billion<br>
choices, demand for SSD-backed storage at an additional cost will be<br>
low.<br>
<br>
I also think it&#39;s going to be very difficult for an admin of a<br>
virtualised block device to tell the difference between:<br>
<br>
=A0 =A0&quot;All my processes are really slow at talking to storage; it&#3=
9;s<br>
=A0 =A0because of my process ID 12345 which is a heavy DB query&quot;<br>
<br>
and:<br>
<br>
=A0 =A0&quot;All my processes are really slow at talking to storage; that&=
#39;s<br>
=A0 =A0definitely a problem with BitFolk&#39;s storage and not anything I<=
br>
=A0 =A0am doing.&quot;<br>
<br>
By the way, I think we&#39;ve done reasonably well at keeping IO latency<br=
>

down, over the years:<br>
<br>
barbar: =A0 =A0<a href=3D"http://tools.bitfolk.com/cacti/graphs/graph_1634_=
6.png" target=3D"_blank">http://tools.bitfolk.com/cacti/graphs/graph_1634_6=
.png</a><br>
bellini: =A0 <a h