CARVIEW |
Jesse Robbins

Jesse Robbins is passionate about Infrastructure, Emergency Management, and technology that helps people be safe, happy, and free. He serves as co-chair of the Velocity Performance & Operations Conference and is part of the O’Reilly Radar. Jesse currently advises companies in Seattle and San Francisco. He previously worked at Amazon.com where his title was “Master of Disaster” and where he was responsible for Website Availability. Jesse is a volunteer Firefighter/EMT & Emergency Manager, and led a task force deployed in Operation Hurricane Katrina.
Thu
Aug 7
2008
Kaminsky DNS Patch Visualization
Dan Kaminsky has posted the details of the widespread DNS vulnerability. Clarified Networks created this visualization of DNS patch deployment over the past month:
Red = Unpatched
Yellow = Patched, "but NAT is screwing things up"
Green = OK
tags:
| comments: 4
| Sphere It
submit:
Sat
Jun 28
2008
The new internet traffic spikes
Theo Schlossnagle, author of Scalable Internet Architectures, gave a great explanation of how internet traffic spikes are shifting:
Lately, I see more sudden eyeballs and what used to be an established trend seems to fall into a more chaotic pattern that is the aggregate of different spike signatures around a smooth curve. This graph is from two consecutive days where we have a beautiful comparison of a relatively uneventful day followed by long-exposure spike (nytimes.com) compounded by a short-exposure spike (digg.com):The disturbing part is that this occurs even on larger sites now due to the sheer magnitude of eyeballs looking at today's already popular sites. Long story short, this makes planning a real bitch.
[...]What isn't entirely obvious in the above graphs? These spikes happen inside 60 seconds. The idea of provisioning more servers (virtual or not) is unrealistic. Even in a cloud computing system, getting new system images up and integrated in 60 seconds is pushing the envelope and that would assume a zero second response time. This means it is about time to adjust what our systems architecture should support. The old rule of 70% utilization accommodating an unexpected 40% increase in traffic is unraveling. At least eight times in the past month, we've experienced from 100% to 1000% sudden increases in traffic across many of our clients.
[Link]
tags:
| comments: 5
| Sphere It
submit:
Tue
Jun 24
2008
Video of Rich Wolski's EUCALYPTUS talk at Velocity
Rich Wolski gave a truly impressive talk at Velocity about an open-source software infrastructure for cloud computing called EUCALYPTUS . The API is compatible with Amazon's EC2 interface, and the underlying infrastructure is designed to support multiple client-side interfaces. EUCALYPTUS is implemented using commonly-available Linux tools and basic Web-service technologies making it easy to install and maintain. Watch and learn...
You can see more videos from Velocity on Blip.tv.
tags: cloud computing, ec2, open source, utility computing, velocity, velocity08
| comments: 0
| Sphere It
submit:
Mon
Jun 23
2008
Hyperic CloudStatus service dashboard launches at Velocity!
Javier Soltero just launched CloudStatus during his Hyperic sponsor session today at Velocity. CloudStatus is a public health dashboard for web services like Amazon's EC2/S3, and Google's App Engine.
Javier called to tell me about this last week after I declared that "Service Monitoring Dashboards are mandatory". This comes right after Amazon and Google had visible outages, and couldn't have happened at a better time. I'm really excited to see this idea take off, as it's something that is critical to the broad adoption of web services and cloud computing.
tags: cloudstatus, hyperic, monitoring, outages, velocity, velocity08, webops
| comments: 6
| Sphere It
submit:
Wed
Jun 18
2008
code_swarm - visualizing the life of open source
code_swarm was created by Michael Ogawa with Processing.
This visualization, called code_swarm, shows the history of commits in a software project. A commit happens when a developer makes changes to the code or documents and transfers them into the central project repository. Both developers and files are represented as moving elements. When a developer commits a file, it lights up and flies towards that developer. Files are colored according to their purpose, such as whether they are source code or a document. If files or developers have not been active for a while, they will fade away. A histogram at the bottom keeps a reminder of what has come before.
(thanks to Todd Ogasawara for pointing this out!)
tags: code, code swarm, infovis, open source, processing, python
| comments: 6
| Sphere It
submit:
Tue
Jun 17
2008
Service Monitoring Dashboards are mandatory for production services!
Google App Engine went down earlier today. GAE is still a developer preview release, and currently lacks a public monitoring dashboard. Unfortunately this means that many people either found out from their app and/or admin consoles being unavailable or from Mike Arrington's post on TechCrunch.
Google has a strong Web Operations culture, and there are numerous internal monitoring tools in use across the company, along with a smaller set available to customers. It's suprising that Google launched a developer platform without providing something beyond an email group, although they are by no means the first to do so.
Service Monitoring Dashboards are mandatory for production services and platforms!
- If you launch a platform that people pay you money for, you need to have a real time service dashboard. Ideally this should be decoupled from the rest of your infrastructure.
- Don't rely on platforms that lack service monitoring dashboards for production.
Many companies are initially reluctant to provide this kind of monitoring to the public, and only do so in reaction to an outage. However, it seems that every company that offers such a dashboard uses it as a source of competitive advantage.
The best example of this is trust.salesforce.com which they launched after series of outages in 2006. Amazon (eventually) launched a status dashboard for AWS, and added RSS feeds for specific services which I think is pretty cool.
Javier Soltero at Hyperic points out
1. The reports of service outages arrive long after anyone who depends on the services can possibly do anything to mitigate their effect.
2. The services themselves seem incapable of providing any visibility into the circumstances that might lead to future outages.[...]Even TechCrunch points out that the Google Apps blog doesn’t even mention the outage. Other clouds rely on blogs such as this one, this one, or maybe even this one (from our good friends at Mosso). These are all places where outages can be discussed, but not the right means for people to find out whether it their application that crashed, or the cloud that it depends on.
(Updated:Niall Kennedy pointed out that GAE is still a preview release, and I agree that my original wording was wrong. My intent is to emphasize the importance of providing a public service dashboard and so I've edited accordingly.)
tags: failure happens, google app engine, infrastructure, monitoring, operations, outages, platforms, saas, web services, webops
| comments: 6
| Sphere It
submit:
Tue
Jun 17
2008
Two new open source projects at Velocity
At Velocity next week there will be two significant open source projects debuting. The first is the Jiffy: Open Source Performance Measurement and Instrumentation tool created by Scott Ruthfield and his team at Whitepages.com.
Most tools for measuring web performance come in two flavors:
- Developer-installed tools (Firebug, Fiddler, etc.) that allow individuals to closely trace single sessions
- Third-party performance monitoring systems (Gomez, Keynote, etc.) that will hit your site occasionally and report back component-level metrics (for a fee)
Neither of these tools give you real-world information on what’s actually happening with your clients—how long are pages really taking to load, what’s the real cost of client-side execution, and what’s the impact of your loading or dependency chain. This is even more important when you don’t host all of your own assets, such as when you load ads or JavaScript from third parties, for example, and you need to monitor their performance.
Thus we built Jiffy—an end-to-end system for instrumenting your web pages, capturing client-side timings for any event that you determine, and storing and reporting on those timings. You run Jiffy yourself, so you aren’t dependent on the performance characteristics, inflexibility, or costs of third-party hosted services.
The second is project is EUCALYPTUS, the Elastic Utility Computing Architecture for Linking Your Programs To Useful Systems, presented by Rich Wolski from UCSB. This project has already started getting attention. (Many thanks to Surj Patel of Structure08/GigaOM for connecting us!)
Eucalyptus is an open-source software infrastructure for implementing "cloud computing" on clusters. The current interface to EUCALYPTUS is compatible with Amazon's EC2 interface, but the infrastructure is designed to support multiple client-side interfaces. EUCALYPTUS is implemented using commonly-available Linux tools and basic Web-service technologies making it easy to install and maintain.
The talk will focus on the design, the implementation tradeoffs we have identified in implementing Eucalyptus as an exploratory tool, and the ways in which we have chosen to address these tradeoffs in the first version of the software.
tags: cloud, cloud computing, ec2, gomez, jiffy, keynote, metrics, performance, structure08, velocity, velocity08, web monitoring, webops
| comments: 3
| Sphere It
submit:
Sat
Jun 14
2008
Understanding Web Operations Culture (Part 1)
“You don’t choose the moment, the moment chooses you. You only choose how prepared you are when it does.” - Fire Chief Mike Burtch
(Note: I became a Firefighter-1 and EMT in 2000. My experiences in the fire service profoundly influence my efforts in technology. Much of my work over the past few years has been translating and distilling my knowledge from these two worlds, teaching others, and finding ways to apply it in the service of both.)
Last week I came upon a truck vs. scooter accident on my way home. I could hear a woman yelling in pain from underneath the truck (a good sign!) and could see a guy in the cab looking panicked and touching his controls. I stopped my car and “surveyed the scene” looking for things that might kill me (traffic, hazmat, downed power lines) or make the situation worse if undetected (additional victims, deflating tires, fires).
It looked like the driver was about to move his truck, which would have definitely made things worse. I used my ‘command voice’ to yell “Put it in park! Stop your engine! Set your brake! Get out and wait!” as I approached the truck.
A city crew came over, and one of them told me “We’ve called 911 and they are on their way.”
I asked them to handle traffic control as I approached my patient. I then introduced myself and asked her if I could help. (I have to obtain consent before assisting an injured person, and a response means I know they have still have their Airway, Breathing, and Circulation intact.)
Her legs were entangled in her scooter which was trapped underneath the truck. While she probably had broken her leg, it didn’t look all that bad. She was still wearing her helmet and it wasn't seriously damaged which meant her head was probably okay too. I did a quick check for bleeding and other serious injuries and did a “mental status check” by asking her name, where she was (“on my way to school”), and what had happened (“I was riding and that a**hole RAN OVER ME!”). This meant she was alert and oriented, which was good.
Now that I was sure there weren’t any other life threatening injuries, I prepared to hold her head for c-spine stabilization. (Once you start holding stabilization, you cannot move again until you are ready to put the patient on a backboard.)
As I positioned myself on the ground and took hold of her head, I explained “I’m going to hold your head now to protect your neck and back. Once the fire department gets here, they are going to get your legs unstuck and then we’ll get you on a backboard. Your job is to keep still and keep talking to us. There will be a lot of commotion and noise around you, and that’s okay. Everyone will be watching out for you and so there is no reason to be scared. We’ve got you.”
tags: culture, ems, executive, firefighting, leadership, management, medicine, operations, velocity, velocity08, webops
| comments: 11
| Sphere It
submit:
Fri
Jun 13
2008
CloudCamp gathering after Velocity
On Tuesday after Velocity closes there will be a CloudCamp gathering at Microsoft's San Francisco Office. I'll be going (unless I'm too exhausted to stand).
CloudCamp was formed in order to provide a common ground for the introduction and advancement of cloud computingThrough a series of local cloudcamp events, attendees can exchange ideas, knowledge and information in a creative and supporting environment, advancing the current state of cloud computing and related technologies. As an informal, member-supported gathering, we rely entirely on volunteers to help with meeting content, speakers, meeting locations, equipment and membership recruitment. We also have corporate sponsors that provide financial assistance with venues, software, books, discounts, and other valuable donations. To become a member, simply register for an upcoming event. Anyone may attend a meeting, there are no fees or dues.
It looks like there is now a London CloudCamp being planned for July 16th as well.
(PS: If you still haven't registered for Velocity and want to attend, you can use my 20% discount code "vel08js".)
tags: barcamp, cloud, cloud computing, cloudcamp, ec2, performance, velocity, webops
| comments: 2
| Sphere It
submit:
Thu
Jun 12
2008
BarCampBank is spreading
When Ben Black and I organized the first BarCampBank in North America last year, we hoped that it would spread. According to William Azaroff's post on NetBanker, the movement is there and growing:
What's all this about BarCampBanks? From a North American premiere in Seattle almost a year ago, we've witnessed two more in the last few months, and eight more are either scheduled, or in the planning process.Well, maybe not exactly “planned.” BarCampBanks emerge more than they are planned.
[...]It started as a technology summit, an un-conference where developers and technology geeks could share exploits, connect, and find like-minded companions to extol the virtue of open-source and emerging technologies over pizza and wine.
And then someone decided that this forum would be a perfect place to talk about banking and finance. Weird. And yet it works.
The next event will be BarCampBankDallas on June 21-22nd at the American Bank of Texas Building in Frisco, Texas. William has details on the other events this year on his blog, and a current list can always be found on the main BarCampBank wiki.
tags: barcamp, barcampbank, disruption, finance, innovation, open space
| comments: 1
| Sphere It
submit:
Recent Posts
- Bill Coleman to keynote Velocity on June 11, 2008
- TLS Report grades and reports on site security on June 9, 2008
- DisasterTech from Where2.0 on May 30, 2008
- Ignite! @ Velocity: Submit your talks & "war stories"... on May 29, 2008
- Burmese translation of Sahana complete! on May 16, 2008
- Automated Infrastructure Podcast on IT Conversations on May 13, 2008
- Structure and Velocity on May 10, 2008
- Disaster Technology for Myanmar/Burma aid workers on May 8, 2008
- You Become what You Disrupt - (part two) on April 13, 2008
- Velocity preview at Web2.0 Expo on April 10, 2008
BUSINESS INTELLIGENCE
RELEASE 2.0
Current Issue

Velocity: Web Operations & Performance
Issue 2.0.9
Back Issues
More Release 2.0 Back IssuesCURRENT CONFERENCES

RailsConf Europe is the largest gathering of the European Ruby on Rails community. This is your chance to meet, connect, and collaborate with other Rails programmers and developers. Read more

New York has long been where the world's biggest industries go online, and as Web 2.0 grows up and gets serious, the time is right to convene the East Coast web communities under the umbrella of the next generation web. Read more
O'Reilly Home | Privacy Policy ©2005-2008, O'Reilly Media, Inc. | (707) 827-7000 / (800) 998-9938
Website:
| Customer Service:
| Book issues:
All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners.