Amazon details cause of AWS outage that hobbled thousands of online sites and services

A past AWS re:Invent conference. (GeekWire Photo)

A “relatively small addition of capacity” to the Amazon Kinesis real-time data processing service triggered a widespread Amazon Web Services outage last week, the company said in a detailed technical analysis over the weekend.

The addition of the new capacity “caused all of the servers in the fleet to exceed the maximum number of threads allowed by an operating system configuration,”  describing a cascade of resulting problems that took down thousands of sites and services.

The outage impacted online services from big tech companies such as AdobeRokuTwilioFlickrAutodesk, and others, including New York City’s Metropolitan Transit Authority and the Washington Post, which is owned by Amazon CEO Jeff Bezos, was also impacted by the outage.

It was an especially ill-timed incident for Amazon, coming just days before its annual AWS re:Invent cloud conference. Reliability has

Read More

Amazon Web Services explains Wednesday outage

Within a few hours, the malfunctions began hitting customers of Amazon Web Services, the company’s cloud-computing unit. Customers of the Amazon-owned Ring security camera service couldn’t log in or watch video. Users struggled to operate their iRobot vacuum cleaners because the outage affected the iRobot Home app. And media companies, including The Washington Post (owned by Amazon founder and chief executive Jeff Bezos), experienced publishing system outages.

Amazon acknowledged that the system failure was exacerbated by the co-dependencies its various services have on one another. The company had been trying to add capacity to its Amazon Kinesis service that customers use to process real-time data including video, audio and application logs. To resolve the issue, Amazon needed to restart a piece of its system it described as “many thousands of servers,” a lengthy process that had to be done gradually. But because other Amazon cloud services rely on Kinesis, including

Read More

Amazon Web Service outage takes down major apps and sites

Screens show Amazon's logo

Amazon Web Services powers big players on the web. 


Getty Images

A prolonged outage of Amazon Web Services — a core component for a vast number of sites and apps — brought part of the internet to a halt Wednesday, as reported earlier by The Verge. The affected sites include not only major players on the web like Flickr, Adobe Spark and Roku, but at least three news outlets.

The Tampa Bay Times, The Philadelphia Inquirer and the Capital Gazette all took to Twitter to inform readers of their AWS-related downtime. 

Read More

Amazon Web Services outage subsides after unplugging services for hours

AWS spike on DownDetector

DownDetector

The AWS status dashboard perhaps isn’t the most reliable source of information about the outage at the minute — Amazon says the problem has affected its ability to provide updates there. As of noon ET, the dashboard reported “The Kinesis Data Streams API is currently impaired in the US-EAST-1 Region. As a result customers are not able to write or read data published to Kinesis streams.”

Update (4:58 PM ET): The AWS dashboard still notes errors on a number of services, while users still have some issues on the various AWS-powered platforms. A full resolution is still likely hours away, as the last update at 3:15 PM ET said “We continue to work towards recovery of the issue affecting the Kinesis Data Streams API in the US-EAST-1 Region. We also continue to see an improvement in error rates for Kinesis and several affected services, but expect full recovery to

Read More

Amazon Web Services outage now over, hit Roku, Adobe, Shipt

  • Amazon Web Services has recovered after a massive outage on Wednesday.
  • The outage only affected one of AWS’s 24 regions, its US-East-1 Region, but it took down many popular web-based services.
  • Roku, Adobe, and Target-owned Shipt were among the sites impacted by the outage.
  • Visit Business Insider’s homepage for more stories.

Amazon Web Services (AWS) is back up after being down for most of Wednesday.

“We have restored all traffic to Kinesis Data Streams via all endpoints and it is now operating normally,” the company said early Thursday morning.

Scores of websites rely on the internet infrastructure to function. The outage only affected one of AWS’s 24 regions, its US-East-1 Region, but it took down many popular web-based services that utilize its servers, like Roku, Flickr, and Adobe Spark, which were among those that have reported issues. The Verge first reported the outage.

Read More

Amazon Web Services outage hits internet and app functions



a young boy using a laptop computer: Amazon Web Services outage hits internet and app functions


© Jocelyn Fernandes
Amazon Web Services outage hits internet and app functions

A multi-hour glitch in Amazon Web Services (AWS) is affecting several websites and apps that depend on the service.

At 5.25 PM ET on Wednesday (3.55 AM IST Thursday) Amazon said recovery might “still be a few hours away”.

“We continue to work towards recovery of the issue affecting the Kinesis Data Streams API in the US-EAST-1 Region. We also continue to see an improvement in error rates for Kinesis and several affected services, but expect full recovery to still take up to a few hours,” a statement on the AWS Service Health Dashboard read.

Later, at 7.42 PM ET (6.12 AM IST) it said there were “steady signs of recovery.”

Amazon told The Verge that while the outage is only affecting one of its 23 geographic AWS regions, it has affected “a large number of internet services.”

Read More

Amazon Web Services outage hobbles businesses

SEATTLE —Amazon’s widely used cloud computing service suffered a major outage in its eastern U.S. operations Wednesday, hampering everything from services for Web-connected security cameras to software applications that businesses use to design products.

Loading...

Load Error

Starting midmorning, a variety of Amazon Web Services applications began to fail, including ones that deliver data and authorize users to access that data. That hobbled companies that rely on AWS, including the Amazon-owned Ring security camera service, iRobot’s Roomba vacuum cleaner app, services from design technology firm Autodesk and the publishing systems of news outlets such as The Washington Post (which is owned by Amazon founder and chief executive Jeff Bezos).

AWS is the world’s largest provider of cloud-computing services, which let customers rent data storage and processing capabilities over the Web instead of running their own data centers. AWS, which competes in the business of providing computing infrastructure via the cloud against

Read More

Amazon Web Services outage causes issues for Roku, Adobe

  • Amazon’s cloud-computing unit, Amazon Web Services, suffered an outage Wednesday.
  • Roku, Adobe and other services that rely on AWS reported some issues with their websites.
  • Major clients like Apple and Slack appeared to be unaffected.

Amazon Web Services experiencing widespread outage

UP NEXT

UP NEXT

Amazon’s cloud-computing service on Wednesday was hit with an outage that took down some websites and services.

Loading...

Load Error

A notice on Amazon Web Services’ status page said it was experiencing problems with Kinesis, its service that processes large streams of data, causing “increased error rates” for a number of websites. The outage also impacted its ability to post updates to the status page.

“We continue to work towards recovery of the issue affecting the Kinesis Data Streams API in the US-EAST-1 Region,” according to a notice, which was posted late afternoon on Wednesday. “We also continue to see an improvement in error rates

Read More

Prolonged AWS outage has taken down a big chunk of the internet

Amazon Web Services (AWS), Amazon’s internet infrastructure service that is the backbone of many websites and apps, has been experiencing a multi-hour outage on Wednesday that is affecting a large portion of the internet. And a full recovery might still be a few hours away, according to the company’s latest update.



a close up of a ball


© Illustration by Alex Castro / The Verge


“We continue to work towards recovery of the issue affecting the Kinesis Data Streams API in the US-EAST-1 Region,” Amazon said in a 3:15PM ET statement on its AWS Service Health Dashboard. “We also continue to see an improvement in error rates for Kinesis and several affected services, but expect full recovery to still take up to a few hours.” In a 5:25PM ET update to the dashboard, the company reiterated that a full recovery may take up to a few hours.

Loading...

Load Error

In an email to The Verge,

Read More

Amazon Web Services experiencing an outage that’s affecting a number of popular services like Roku, Adobe, and Target-owned Shipt



map: Down Detector


© Provided by Business Insider
Down Detector

  • Amazon Web Services is experiencing an outage, with Down Detector reporting over 1,000 user complaints.
  • Many websites rely on the company’s internet infrastructure service to operate.
  • Roku, Adobe, and Target-owned Shipt are among the sites reporting the outage to be impacting its services on Wednesday.
  • Visit Business Insider’s homepage for more stories.

Amazon Web Services has been down for most of Wednesday.  The Verge first reported the outage. 

In a statement to Business Insider, an Amazon spokesperson said, “Kinesis has been experiencing increased error rates this morning in our US-East-1 Region that’s impacted some other AWS services. We are working toward resolution.” 

Scores of websites rely on the internet infrastructure to function. The outage only affected one of AWS’s 23 regions, but it took down many popular web-based services that utilize its servers, like Roku and Adobe Spark, which were among those that

Read More