Welcome!

@DevOpsSummit Authors: Automic Blog, Liz McMillan, Elizabeth White, Yeshim Deniz, PagerDuty Blog

Blog Feed Post

Thinking About APM? 4 Key Considerations for Buy vs. Build Your Own

image_pdfimage_print

Most technology folks have heard Marc Andreessen’s provocative statement, “Software is eating the world.”  Whether you agree fully or not, you’re realizing that your business critical software applications increasingly drive both the top-line revenue growth and the bottom-line operational efficiency of your company – and often form the pillar of your business identity.  

Legacy monitoring systems you have in place, capturing and alerting on scores of infrastructure level metrics, have helped protect your technology investment to some degree.  I’ve worked for two leaders in that space, HP and BMC, so can personally attest to the real benefits of server, network and database monitoring that clients achieve.  However, as consumer demand for superior services and faster innovation accelerate, we see that the applications and associated business transactions are what end-users ultimately care about.  You can no longer afford slow response time, let alone application outage situations, as customers will delay or abandon purchases – or worse yet switch to a competitor when you have unacceptable application performance.  And unfortunately these “Yellow Light” or slow performance situations are the most challenging to detect and fix!

So you’ve come to the conclusion that you need a full-fledged application performance management (APM) solution.  The question some companies wrestle with at this point is: “Should we invest in an 3rd party APM solution or build it ourselves?”

Four key considerations should be:

  1. Upfront Costs – such as Initial Project Build & Software License cost.

  2. Ongoing, Annual Solution Costs – such as Server / Storage footprint, administrative maintenance & support, & agile development / release activities.

  3. Solution Capabilities Driving Benefits – chiefly, the ability to drive down the number of performance defects in production, as well as the MTTR when issues do occur.

  4. Opportunity Costs – personnel resources working on in-house APM, versus are there mature 3rd party APM solutions available for purchase.

1.  Upfront Costs

It’s difficult to estimate exactly how long it would take a company to develop a basic application monitoring tool in-house – but we’ll give it a logical shot.  Of course, on the plus side, the company would avoid spending money on a “commercial off-the-shelf” (COTS) 3rd party software application.  Based on experience for design, development, testing, and release, a good estimate for an in-house Initial Project Build is a team of 2-3 Engineers about 6 months to have a basic, log parsing and alerting tool ready.  A more robust tool for a medium to large sized deployment may be 2-3x this size and investment.  A gaming company we work with, when assessing an in-house build situation, estimated an APM product development lifecycle in the 12 to 18 month range.  Why?  APM functionality that involves tracing the user experience of distributed transactions, where every call needs to be traced across each service layer, is non-trivial technical work.  Also, you’ll need to factor in one-time hardware and prerequisite software purchasing costs.  So a ballpark cost from $400K to well into seven figures is reasonable.

What would be the upfront software licensing cost of a 3rd party APM solution?  Probably in a similar range, perhaps higher in certain cases.  Also, many APM companies offer lower annual SaaS subscription costs as an alternative to full upfront licensing payments – which add up to the same licensing fees over 3-5 years.  However, you should take into consideration that some solutions such as AppDynamics, which can be downloaded and installed via self-service within hours, provide immediate Time-to-Value versus waiting for a full software development lifecycle to occur for a custom built solution.

Advantage:  Cost = In-house (slight? depends on robustness of APM solution built), Time-to-Value = 3rd Party APM

2.  Ongoing, Annual Solution Costs

First, let’s determine the hardware & storage footprint required for the solution.  Typical in-house developed solutions architect for over-capacity as a rough estimate because of unknowns, and to avoid encountering limitations & performance issues.  A good estimate per environment (Dev, Test, Prod) may be 2 Large Servers and 16 TB of Storage for a starter in-house APM solution.  This cost might run in the $100K to $135K range per year.

For 3rd party APM solutions, the specs are well-known, validated, and published.  A leading APM solution like AppDynamics has been built and tuned via R&D by specialists over several years.  The footprint for a similar medium-sized deployment would be 1 Medium Server and 6 TB of Storage, for a rough cost of about $40-50K per year – or less than half of the in-house cost.

From an FTE support perspective for the in-house solution, you have to understand the administrative, support, & enhancement / new development labor required.  A good admin & support estimate would run about 1-2 FTEs, and new development might run 2 engineering FTEs to keep up with enhancement requests and coverage for new applications & technologies.  Remember, users will not expect the APM solution to stay static!  You might start with basic metric stores and time series data, but this will quickly run out of steam.  Next, you’ll want to build a baseline engine for the metric store based on load patterns and percentiles of metrics, as examples.  Demand for dashboarding and security access control requirements come into play, and require much design and testing work especially as the solution scales.  So this annual labor cost would run in the $375K plus range.

On top of that, in today’s Agile DevOps world, there are additional maintenance / revision labor costs each time a business application is released to production.  Appliances and/or monitoring agents need updating, and both application and business transaction topology maps likely need to be revised manually.  As the frequency of application release grows, often to a bi-weekly application release schedule, these are not insignificant tasks.  We estimate in a medium sized deployment, this could require about 2,000 labor hours per year to keep up, or about $100K.

In the AppDynamics APM world, these types of capabilities are already built into the solution.  So the maintenance per application release is zero since there is automated application discovery, mapping, and business transaction flows out of the box.  The ongoing FTE administrative & maintenance requirements for a medium-sized deployment are 1 FTE, or about $125K/year.  And new development is covered in the license costs via the hundreds of R&D professionals contributing to the various releases of the 3rd party software.

Advantage:  3rd Party APM (large, especially adding up multiple years)

3.  Solution Capabilities Driving Benefits

Next we look at the ability of an APM solution to provide benefits to your enterprise – which can be grouped into reducing costs, mitigating risks, and increasing or protecting revenue.  Two key performance metrics we suggest for measuring impact on cost, risk, and revenue are:

  1. # defects released to production

  2. Mean time to repair (MTTR) per performance issue

At AppDynamics, this is where we’ve invested our R&D dollars since 2008, and our industry-exceeding Net Promoter Score (NPS) of 84 – i.e., more than 8 in 10 customers would recommend us to a friend or colleague – is a testimony to our ability to achieve these benefits.

By leveraging AppDynamics in Pre-Production, our clients often report reduction in performance issues released to Production of 40%.  And by watching every line of code executed in Production, and measuring & scoring each transaction, we provide a “3 clicks to resolution” approach that often reduces MTTR per performance issue by 65% or more.  This is true of small application environments, as well as large deployments over 20,000 JVMs.

For an in-house solution, you have to assess what it would take to build similar APM capabilities to achieve these levels of defect and MTTR reduction.  How many years, developers, and dollars?  (And, as one client executive recently told us, “If I could do this, why wouldn’t my company be competing in the APM software space?!”)  Or alternatively and more likely, “let’s stitch something low-cost together” in-house.  Admittedly this sacrifices capability for cost cost, which translates into fewer features to address the MTTR and # of performance issue challenges you face.

For ballpark purposes, then, let’s credit the in-house solution in helping reduce both # of defects and MTTR up to 20%.  If we use an industry average cost per minute of slowness / downtime equating to $500 (inclusive of both labor and revenue protection factors), and there is one Sev1 performance issue per application per quarter – the difference between the in-house solution versus an APM solution would equate to over $1M per year for a medium sized deployment.

Advantage:  3rd Party APM (not close; and these add up year over year, too)

4.  Opportunity Costs

These costs deal chiefly with choosing what is most valuable for your developers to spend their time on.  Especially in today’s high-technology enterprises, there are excellent engineers capable of building fantastic tools across a wide range of areas – so it is tempting to initiate an in-house APM build project and get something out the door.  However, APM is not these engineers’ specialty and their talents are often better utilized on alternative software projects related to the core goods & services your company sells to your end-use customers that drive revenue.

This is an area we won’t attempt to quantify, as it’s more of a qualitative assessment and business decision specific to your organization.  But with a fairly mature and continually developing 3rd Party APM market, for most enterprises it’s logical to say….

Advantage:  3rd Party APM

4.  Summary

While the initial, upfront set of costs for an in-house vs. 3rd party APM solution purchase may be about the same (license vs build) – which leads some organizations to consider a “Do It Yourself” approach – there are significant ongoing annual costs for the care and feeding of an in-house APM solution compared to the 3rd party APM alternative.  These include the infrastructure footprint, as well as labor costs associated with administration, maintenance & enhancements.

The biggest differential in cost is typically related to the chief purpose of an APM solution – how often does it proactively reduce the number of production defects, and how fast does it help you resolve performance issues when they do inevitably occur?

Screen Shot 2014-06-19 at 8.47.02 AM

For a medium sized deployment, the total cost / benefit advantages of a 3rd party APM solution easily exceed $1M per year when compared to the in-house build alternative.  This benefit accumulates year over year.  And it’s worth mentioning here at AppDynamics, we achieve magnitudes of benefit even beyond other 3rd party APM solutions with lesser capabilities.  We’ve leveraged the feedback of our over 1,000 customers during the past several years to drive R&D and greater benefit realization.

For AppDynamics, these advantages stem from:

  1. The way our solution is architected to require minimal setup, upkeep and Time-to-Value, while providing ongoing Ease of Use.

  2. Key capabilities – such as transaction tracing across complex, distributed applications, in your data center and the cloud – which lead to significant improvement in KPIs such as # performance defects and MTTR.

  3. Our ability to intelligently scale to support the most complex and largest Pre-Production and Production environments.

  4. Thought-leadership expanding into our “Application Intelligence” platform with a host of new modules and capabilities.

So when assessing an in-house vs 3rd party APM solution, consider a multi-year TCO horizon and not just a short-term initial cost estimate.  Our personnel at AppDynamics standby to provide you help in not only getting a deep-dive on the APM market and our solution features, but also to analyze the value of APM choices via a detailed ROI assessment.

Thinking of trying a next generation APM solution rather than build it yourself? Try AppDynamics for free today!

The post Thinking About APM? 4 Key Considerations for Buy vs. Build Your Own written by appeared first on Application Performance Monitoring Blog from AppDynamics.

Read the original blog entry...

More Stories By Jyoti Bansal

In high-production environments where release cycles are measured in hours or minutes — not days or weeks — there's little room for mistakes and no room for confusion. Everyone has to understand what's happening, in real time, and have the means to do whatever is necessary to keep applications up and running optimally.

DevOps is a high-stakes world, but done well, it delivers the agility and performance to significantly impact business competitiveness.

@DevOpsSummit Stories
Providing the needed data for application development and testing is a huge headache for most organizations. The problems are often the same across companies - speed, quality, cost, and control. Provisioning data can take days or weeks, every time a refresh is required. Using dummy data leads to quality problems. Creating physical copies of large data sets and sending them to distributed teams of developers eats up expensive storage and bandwidth resources. And, all of these copies proliferating the organization can lead to inconsistent masking and exposure of sensitive data. But some organizations are adopting a new method of data management for DevOps that is delivering transformational business outcomes in faster time to market, lower costs, and great control. In his session at DevOps Summit, Brian Reagan, Managing Director of Blackthorne Consulting Group, an Actifio company, revi...
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm. In his Day 3 Keynote at 20th Cloud Expo, Chris Brown, a Solutions Marketing Manager at Nutanix, will explore the ways that Nutanix technologies empower teams to react faster than ever before and connect teams in ways that were either too complex or simply impossible with traditional infrastructures.
DevOps has often been described in terms of CAMS: Culture, Automation, Measuring, Sharing. While we’ve seen a lot of focus on the “A” and even on the “M”, there are very few examples of why the “C" is equally important in the DevOps equation. In her session at @DevOps Summit, Lori MacVittie, of F5 Networks, explored HTTP/1 and HTTP/2 along with Microservices to illustrate why a collaborative culture between Dev, Ops, and the Network is critical to ensuring success.
Interoute has announced the integration of its Global Cloud Infrastructure platform with Rancher Labs’ container management platform, Rancher. This approach enables enterprises to accelerate their digital transformation and infrastructure investments. Matthew Finnie, Interoute CTO commented “Enterprises developing and building apps in the cloud and those on a path to Digital Transformation need Digital ICT Infrastructure that allows them to build, test and deploy faster than ever before. The integration of Rancher software with Interoute Digital Platform gives developers access to a managed container platform that sits on a global privately networked cloud, enabling true distributed computing.”
SYS-CON Events announced today that SoftLayer, an IBM Company, has been named “Gold Sponsor” of SYS-CON's 18th Cloud Expo, which will take place on June 7-9, 2016, at the Javits Center in New York, New York. SoftLayer, an IBM Company, provides cloud infrastructure as a service from a growing number of data centers and network points of presence around the world. SoftLayer’s customers range from Web startups to global enterprises.
SYS-CON Events announced today that Conference Guru has been named “Media Sponsor” of SYS-CON's 20th International Cloud Expo, which will take place on June 6–8, 2017, at the Javits Center in New York City, NY. A valuable conference experience generates new contacts, sales leads, potential strategic partners and potential investors; helps gather competitive intelligence and even provides inspiration for new products and services. Conference Guru works with conference organizers to pass great deals to great conferences, helping you discover new conferences and increase your return on investment.
With major technology companies and startups seriously embracing Cloud strategies, now is the perfect time to attend @CloudExpo | @ThingsExpo, June 6-8, 2017, at the Javits Center in New York City, NY and October 31 - November 2, 2017, Santa Clara Convention Center, CA. Learn what is going on, contribute to the discussions, and ensure that your enterprise is on the right path to Digital Transformation.
SYS-CON Events announced today that CA Technologies has been named “Platinum Sponsor” of SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY, and the 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. CA Technologies helps customers succeed in a future where every business – from apparel to energy – is being rewritten by software. From planning to development to management to security, CA creates software that fuels transformation for companies in the application economy.
SYS-CON Events announced today that Loom Systems will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Founded in 2015, Loom Systems delivers an advanced AI solution to predict and prevent problems in the digital business. Loom stands alone in the industry as an AI analysis platform requiring no prior math knowledge from operators, leveraging the existing staff to succeed in the digital era. With offices in San Francisco and Tel Aviv, Loom Systems works with customers across industries around the world.
What if you could build a web application that could support true web-scale traffic without having to ever provision or manage a single server? Sounds magical, and it is! In his session at 20th Cloud Expo, Chris Munns, Senior Developer Advocate for Serverless Applications at Amazon Web Services, will show how to build a serverless website that scales automatically using services like AWS Lambda, Amazon API Gateway, and Amazon S3. We will review several frameworks that can help you build serverless applications, such as the AWS Serverless Application Model (AWS SAM), Chalice, and ClaudiaJS.
SYS-CON Events announced today that Interoute, owner-operator of one of Europe's largest networks and a global cloud services platform, has been named “Bronze Sponsor” of SYS-CON's 20th Cloud Expo, which will take place on June 6-8, 2017 at the Javits Center in New York, New York. Interoute is the owner-operator of one of Europe's largest networks and a global cloud services platform which encompasses 12 data centers, 14 virtual data centers and 31 colocation centers, with connections to 195 additional third-party data centers across Europe. Its full-service Unified ICT platform serves international enterprises and many of the world’s leading service providers, as well as governments and universities.
Have you ever noticed how some IT people seem to lead successful, rewarding, and satisfying lives and careers, while others struggle? IT author and speaker Don Crawley uncovered the five principles that successful IT people use to build satisfying lives and careers and he shares them in this fast-paced, thought-provoking webinar. You'll learn the importance of striking a balance with technical skills and people skills, challenge your pre-existing ideas about IT customer service, and gain new insights into how to build your own satisfying and rewarding career by rising above the ordinary and mundane to build an extraordinary life and career as a world-class Compassionate Geek.
SYS-CON Events announced today that T-Mobile will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. As America's Un-carrier, T-Mobile US, Inc., is redefining the way consumers and businesses buy wireless services through leading product and service innovation. The Company's advanced nationwide 4G LTE network delivers outstanding wireless experiences to 67.4 million customers who are unwilling to compromise on quality and value.
SYS-CON Events announced today that Infranics will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Since 2000, Infranics has developed SysMaster Suite, which is required for the stable and efficient management of ICT infrastructure. The ICT management solution developed and provided by Infranics continues to add intelligence to the ICT infrastructure through the IMC (Infra Management Cycle) based on mathematical analysis and forecasting Big Data Analyze and Control.
SYS-CON Events announced today that SD Times | BZ Media has been named “Media Sponsor” of SYS-CON's 20th International Cloud Expo, which will take place on June 6–8, 2017, at the Javits Center in New York City, NY. BZ Media LLC is a high-tech media company that produces technical conferences and expositions, and publishes a magazine, newsletters and websites in the software development, SharePoint, mobile development and commercial UAV markets.
SYS-CON Events announced today that Cloudistics, an on-premises cloud computing company, has been named “Bronze Sponsor” of SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Cloudistics delivers a complete public cloud experience with composable on-premises infrastructures to medium and large enterprises. Its software-defined technology natively converges network, storage, compute, virtualization, and management into a single platform to drive unprecedented simplicity in the data center. Customers can start with a base infrastructure and scale to multi-site and multi-geo infrastructures with predictable economics and performance.
SYS-CON Events announced today that HTBase will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. HTBase (Gartner 2016 Cool Vendor) delivers a Composable IT infrastructure solution architected for agility and increased efficiency. It turns compute, storage, and fabric into fluid pools of resources that are easily composed and re-composed to meet each application’s needs. With HTBase, companies can quickly provision resources and deploy unique, mission-critical, self-designed solutions to add-onto or create any type of infrastructure as per the business requirement. HTBase is the first company to enable a true multi-cloud strategy, enabling organizations to automate movement of data and workloads between private and public clouds. This means that organizations can now move data and workloads between pub...
While DevOps most critically and famously fosters collaboration, communication, and integration through cultural change, culture is more of an output than an input. In order to actively drive cultural evolution, organizations must make substantial organizational and process changes, and adopt new technologies, to encourage a DevOps culture. Moderated by Andi Mann, panelists discussed how to balance these three pillars of DevOps, where to focus attention (and resources), where organizations might slip up with the wrong focus, how to manage change and risk in all three areas, what is possible and what is not, where to start, and especially how new structures, processes, and technologies can help drive a new DevOps culture.
In their Live Hack” presentation at 17th Cloud Expo, Stephen Coty and Paul Fletcher, Chief Security Evangelists at Alert Logic, provided the audience with a chance to see a live demonstration of the common tools cyber attackers use to attack cloud and traditional IT systems. This “Live Hack” used open source attack tools that are free and available for download by anybody. Attendees learned where to find and how to operate these tools for the purpose of testing their own IT infrastructure. They also witnessed a cyber-attack from both sides – attacker and defender. An inside view of how indicators of compromise are researched to develop security content to be deployed for detection based on these attacks.
SYS-CON Events announced today that MobiDev, a client-oriented software development company, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place June 6-8, 2017, at the Javits Center in New York City, NY, and the 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. MobiDev is a software company that develops and delivers turn-key mobile apps, websites, web services, and complex software systems for startups and enterprises. Since 2009 it has grown from a small group of passionate engineers and business managers to a full-scale mobile software company with over 200 developers, designers, quality assurance engineers, project managers in house, specializing in the world-class mobile and web development.
Virtualization over the past years has become a key strategy for IT to acquire multi-tenancy, increase utilization, develop elasticity and improve security. And virtual machines (VMs) are quickly becoming a main vehicle for developing and deploying applications. The introduction of containers seems to be bringing another and perhaps overlapped solution for achieving the same above-mentioned benefits. Are a container and a virtual machine fundamentally the same or different? And how? Is one technically superior to the other? What about performance and security? Does IT need either one, or both?
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm.
Keeping pace with advancements in software delivery processes and tooling is taxing even for the most proficient organizations. Point tools, platforms, open source and the increasing adoption of private and public cloud services requires strong engineering rigor - all in the face of developer demands to use the tools of choice. As Agile has settled in as a mainstream practice, now DevOps has emerged as the next wave to improve software delivery speed and output. To make DevOps work, organizations must focus on what is most relevant to deliver value, reduce IT complexity, create more repeatable agile-based processes and leverage increasingly secure and stable, cloud-based infrastructure platforms.
ChatOps is an emerging topic that has led to the wide availability of integrations between group chat and various other tools/platforms. Currently, HipChat is an extremely powerful collaboration platform due to the various ChatOps integrations that are available. However, DevOps automation can involve orchestration and complex workflows. In his session at @DevOpsSummit at 20th Cloud Expo, Himanshu Chhetri, CTO at Addteq, will cover practical examples and use cases such as self-provisioning infrastructure/applications, self-remediation workflows, integrating monitoring and complimenting integrations between Atlassian tools and other top tools in the industry.
For organizations that have amassed large sums of software complexity, taking a microservices approach is the first step toward DevOps and continuous improvement / development. Integrating system-level analysis with microservices makes it easier to change and add functionality to applications at any time without the increase of risk. Before you start big transformation projects or a cloud migration, make sure these changes won’t take down your entire organization.