Behold, a startling tale of awful tech support staff, too little too late management strategies, and verbose tirades of daring do that was supposedly done did! Since January, my virtual private server account with Tektonic has gone down more than a freshman sorority pledge. Every time, I have had to play this two-step swinging game with them to get any sort of response as to why my server was behaving in a way that could only be referred to as "wonky". After the jump, you will read the transcript of the camels back breaking.
If you don't want to read 36 hours worth of support ticket transcripts, the cliff notes version is that Tektonic basically went out of their way to lose my business.
So, I've moved into an account with Linode and while it's still our honeymoon, it's moving along pretty sweet right now. In the roughly 48 hours since the move from Old Busted to New Hotness, the traffic has spiked to approximately 5x the load due to my friend Rick, an editor involved in the news side of the comic book industry, parting ways with his now-former employer for the second time in a year. His site, a WordPress powered site with only mild (read: not aggressive) caching is singlehandedly responsible for said 5x increase in incoming traffic. The new server has not even shrugged under the new load. No sweat has been broken. I might turn up available Apache processes to compensate but seriously, it just doesn't seem to care.
Mon Aug 04 2008 11:20AM by Ryan
IP: (redacted)
Any one want to tell me why my VPS is reporting as "The system is unavailable" in the control panel, again?
This is the second time in the last week this has happened.
Realizing that I could not recieve their emails as my account was now down, I requested a simple follow-up at another address.
Mon Aug 04 2008 11:21AM by Ryan
Client IP: (redacted)
Addendum: since my server is down, my email, again, doesn't work. Please direct all correspondence to (redacted) until this is resolved.
At this point, the ticket was closed upon start up of server, without anyone telling me what happened or what they were doing about it. Approximately 12 hours elapsed after my account came back up, and it went down again.
Tue Aug 05 2008 11:38AM by Ryan
Client IP: (redacted)
My VPS is down and reporting as "The system is unavailable" in the control panel, again. This is the third time since last week.
Is the machine my VPS lives on experiencing hardware problems?
Additionally, why was my ticket from yesterday closed with no correspondence?
Tue Aug 05 2008 11:44AM by support@tektonic.net
Hi, The host is currently down for a file system check as we found file system errors on the node. ETA is 1-2 hours.
That's it. That's the reply I received. Concise, yes. Almost to the point of rudeness. I wanted to ring them up and shout "WELL NO SHIT ASSHOLES, DO YOU KNOW WHAT I DO FOR A LIVING?"
But they don't, so I didn't. I'm classy.
Tue Aug 05 2008 11:48AM by Ryan
Client IP: (redacted)
Why don't you notify customers about these unexpected downtimes then they occur?
Is there a reason these downtimes almost always occur during peak hours?
Was anyone going to tell me that I may have suffered data loss from a potentially dirty file system?
Tue Aug 05 2008 12:02PM by support@tektonic.net
Hi, The outage wasn't planned so a notice could not of been given. No data loss is expected.
For those playing along at home, those are the magic five words. That was when I did a sanity check in my nightly backups and began looking for a new host.
Tue Aug 05 2008 02:51PM by Ryan
Client IP: (redacted)
"No data loss is expected" is not good enough. If I have files being written when this happens, or database tables being flushed while the system is halting, and the sync doesn't happen fast enough or it writes the data to inodes that then get "massaged" by fsck on the way back up, there is a chance, though slim, of data loss.
The outage happened during business hours and it is well within reason to pull a list of affected customers whose accounts are and fire off a quick email to them along the lines of "hey, we dropped the ball and something blew up. again."
Even if I wouldn't haven gotten it, my email server being hosted on the affected VPS, the effort alone would keep me from foaming at the mouth over this.
As it is right now I've been down all day waiting for a filesystem check to finish up for the second time in as many days. Is there a plan to prevent this from happening? Is there a migration route to a more stable server?
Frankly put, and you can forward this to management or sales if you want since you're so adamant about stratification between technical support and business administration (and I cannot seem to get a customer service phone number out of anyone there), what are you doing to keep my business? If the last 6 months of service are an indication, I can't count on your service to be even marginally reliable. I'm not asking for the mythical 5 9's in reliability here; I'm just asking for my account to not be knocked offline two days in a row because some system administrator didn't notice a mess of disk write errors in /var/log/messages.
I cannot get a straight answer from anyone about whatever happened every time my VPS is offline. That's infuriating enough, but this cavalier handling of my data is utterly unacceptable, especially seeing as how you just billed me for another month of this same level of lacking service YESTERDAY. Maybe I'm not pouring thousands of dollars of high end infrastructure dollars into your business, but maybe there is a damned good reason for that.
Tue Aug 05 2008 06:20PM by support@tektonic.net
Hello Ryan,
For major outages we post threads on the forums to track them. In regards to the current one it can found here. http://www.tektonic.net/forum/showthread.php?p=5089&posted=1#post5089 We find this much more efficient due to cases such as yours.
The previous errors you have seen were tied to the bad motherboard that was mentioned in the thread above. This will solve and future occurrences of such problems in the future.
We actively monitor all of our servers for a multitude of problems but things such as this can only have so much preventative measure put in place.
I apologize for any down time you've experienced due to the failed hardware and if there is anything we can do please let us know.
Thanks, Bruce
Tue Aug 05 2008 06:43PM by Ryan
Client IP: (redacted)
I want someone to explain to me the contradictory statements of this support ticket thread and the forum thread, specifically why when I asked explicitly about data loss or file corruption, I received the curt reply of:
"No data loss is expected."
No context, no explanation, nothing more than a terse 5 word answer that provides me with as much assurance as it does information.
Now, having had this forum thread pointed out to me FINALLY, I checked the thread to find the following from Matt Ayres:
"VPS's are starting. There may be file corruption."
I also want to know why no one thought to point out this forum thread when they knew this was going to become an all day affair. I've been waiting since 2:00pm for what I thought was just an fsck to complete.
Also, why isn't my VPS running yet if the VPSes were coming back up as of what looks to be 6:30pm EST from the forum? The control panel still lists my VPS status as "The system is unavailable".
Tue Aug 05 2008 07:40PM by support@tektonic.net
Hello,
Initially it was believed to be a simple issue with the raid card that just needed replacing. At the time of comment we had no indication of possible data loss.
After swapping the card it became apparent that it was indeed a motherboard issue.
In regards to the last part of your email there was more corruption and errors present then we had anticipated. Starting the VPS's began ok but was not stable.
Thanks, Bruce
I am officially spitting blood at this point. But it's ok, I've made up my mind and started moving ahead. Nothing to do now but log into the server that is finally up, and start moving over whatever incidentals I might have missed in my nightly backup, right?
Wed Aug 06 2008 10:47AM by Ryan
Client IP: (redacted)
I see my server is down, again. That's ok though. After being fed up with the fiasco yesterday, I moved into a new account last night, at more than twice the price for a little less than double the resources, with a company who promised me, explicitly, that they could provide a more stable account than you. They had me at "we can do better than tektonic" and the price never even entered into it.
It didn't help your case any that my databases, file system, and in some cases system binaries were all corrupted by the errant, but recurring, hardware problems. Fortunately for all involved (but no one more so than me) I smelled this coming three weeks ago and started rsyncing the important data and nightly database backups off the server the last time there was an extended outage.
I'm canceling my account, and I'll be contacting sales to request a full refund of the payment posted for early august as I've been unable to use my account thus far, and have no intentions of continuing to use it any longer. If you can expedite that process, wonderful. If not, well, that doesn't seem to be any different from any other experience I've had with your support.
I'm honestly amazed how, looking back at the year on and off between two accounts I spent with Tektonic at how often my VPS was down, unavailable, or unpredictable. Looking at your server uptime report, it kills me to see how my account invariably winds up placed on the least reliable server in your farm.
No Comments