Welcome!

@DevOpsSummit Authors: Elizabeth White, Liz McMillan, Zakia Bouachraoui, Yeshim Deniz, Pat Romanski

Related Topics: @DevOpsSummit, Linux Containers, Containers Expo Blog

@DevOpsSummit: Blog Feed Post

NASA’s Juno Mission and IT Operations | @DevOpsSummit #APM #DevOps #ContinuousTesting

How does all this relate to human problems of operating an ITOps environment?

NASA's Juno Mission and IT Operations
By Ophir Ronen

I've always wanted to be a starship pilot traveling the stars. While there is a slim chance of interstellar travel happening in my lifetime, we are starting to enter a fascinating era. We're doing incredible things like landing on cometstesting ion engines, and even exploring EM-drives. What's especially exciting right at this moment is humanity placing a probe around one of the most intense environments in the solar system - orbiting Jupiter.

Harsh Environments
The Juno spacecraft has to deal with an incredibly harsh environment. The biggest challenge is the intense radiation - 20,000 greater than Earth's - which Juno will not survive but rather contend with for a brief time. "Once these electrons hit a spacecraft, they immediately begin to ricochet and release energy, creating secondary photons and particles, which then ricochet," Heidi Becker, leader of Juno's radiation-monitoring team, said during a news conference last month. "It's like a spray of radiation bullets."

Why am I bringing up the Jupiter mission in the context of IT Operations? How does all this relate to human problems of operating an ITOps environment? The answer is simple - both pose harsh environments that require planning, well-defined processes, and appropriate tooling in order to endure and thrive. The IT Operations version of a spray of radiation bullets is the at times overwhelming flood of non-actionable and actionable alerts flowing in from the various management systems.

In the past, we called these non-actionable alerts "noise" but we're moving away from that nomenclature as we're discovering golden nuggets of leading and trailing edge indicators in the sea of IT Operations alert data.

Alert Suppression
When my former company, Event Enrichment HQ, was acquired by PagerDuty late last year, the expectation was to augment the existing excellent array of incident response capabilities with event management focused enhancements. We initiated this effort by creating our PagerDuty common event format (PD-CEF) with which we normalize and structure alerts from your management systems. By doing so, we set the stage with which to build new and powerful tools to help you accelerate incident response. Building on that solid foundation of normalized event data, our new event rules engine allows you to classify groups of alerts and to act on them, starting with event and alert suppression. Alert suppression is necessary as our philosophy to deal with the enormous load of alerts generated by today's infrastructure is not to drop them but instead suppress them.

Why suppress alerts you ask? Our research has shown that many of those so-called "noise" alerts are leading edge indicators to much more severe issues. By sending in more events rather than less to PagerDuty, you will gain a much deeper and more profound understanding of the event flows and alert clusters in your IT Infrastructure using our new IT Operations visualization tools.

The Future
As you will see at PagerDuty Summit, these enhancements to PagerDuty's core offering will go far beyond what you have seen from us thus far. We are intensely focused on providing you the tooling with which to give you a deeper understanding and specific context to issues and incidents which impact your company.

Now a year in after the acquisition, I'm excited to report that PagerDuty has undergone an evolutionary leap into the future. We have always and will continue to embrace lean and agile methodology as per Tim's earlier post; we're focused on learning and empathy as described by Jonny; and we're creating a profound fusion of event management (data) and incident management (people) capabilities. These are heady times here at PagerDuty.

We're now T-1 week away from PagerDuty Summit where we'll kick off this wild ride and introduce you to all of these new capabilities. If you join us at The Village on Sept 13th, you will get to experience it first hand. I'm looking forward to seeing you there!

Referenced articles:

The post NASA's Juno Mission and IT Operations appeared first on PagerDuty.

Read the original blog entry...

More Stories By PagerDuty Blog

PagerDuty’s operations performance platform helps companies increase reliability. By connecting people, systems and data in a single view, PagerDuty delivers visibility and actionable intelligence across global operations for effective incident resolution management. PagerDuty has over 100 platform partners, and is trusted by Fortune 500 companies and startups alike, including Microsoft, National Instruments, Electronic Arts, Adobe, Rackspace, Etsy, Square and Github.

@DevOpsSummit Stories
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more business becomes digital the more stakeholders are interested in this data including how it relates to business. Some of these people have never used a monitoring tool before. They have a question on their mind like "How is my application doing" but no idea how to get a proper answer.
Enterprises are universally struggling to understand where the new tools and methodologies of DevOps fit into their organizations, and are universally making the same mistakes. These mistakes are not unavoidable, and in fact, avoiding them gifts an organization with sustained competitive advantage, just like it did for Japanese Manufacturing Post WWII.
DevOpsSUMMIT at CloudEXPO, to be held June 25-26, 2019 at the Santa Clara Convention Center in Santa Clara, CA – announces that its Call for Papers is open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's largest enterprises – and delivering real results. Among the proven benefits, DevOps is correlated with 20% faster time-to-market, 22% improvement in quality, and 18% reduction in dev and ops costs, according to research firm Vanson-Bourne. It is changing the way IT works, how businesses interact with customers, and how organizations are buying, building, and delivering software.
This is going to be a live demo on a production ready CICD pipeline which automate the deployment of application onto AWS ECS and Fargate. The same pipeline will automate deployment into various environment such as Test, UAT, and Prod. The pipeline will go through various stages such as source, build, test, approval, UAT stage, Prod stage. The demo will utilize only AWS services including AWS CodeCommit, Codebuild, code pipeline, Elastic container service (ECS), ECR, and Fargate.
The current environment of Continuous Disruption requires companies to transform how they work and how they engineer their products. Transformations are notoriously hard to execute, yet many companies have succeeded. What can we learn from them? Can we produce a blueprint for a transformation? This presentation will cover several distinct approaches that companies take to achieve transformation. Each approach utilizes different levers and comes with its own advantages, tradeoffs, costs, risks, and outcomes.