Skip to main content

Problem Management for Newbies! Part 1 of 2


Getting Started with Problem Management
To understand the process of Problem Management one must first understand that a problem is distinctively different than an Incident.  It is tracked and recorded separately, it requires a very different skill set and has a different objective than those that are required for “Incident Management”.  Problem records are unique entities and are reported upon separately.  A repeatable lean problem management process could very well be the glue that helps IT Service providers integrate and automate much of the work and effort required to “prevent” “Eliminate” and to “Minimize” the impact of incidents on your business and end user customers.
While an incident is an unplanned interruption that creates an impact to one or more business services, the problem is actually the cause of one or more incidents.   Example:  “I can’t access the ERP system”, “The web portal will not come up!”   “I can’t log in” are all examples of incidents.   The cause or the “problem” might be that the router on the network is down.  Therefore, you could have many incidents related to a single problem record.  It is important to note that an incident never becomes a problem.  The Service Desk Agent owns the incident and the problem management team or technician will work the problem to identify the root cause of these incidents and most importantly provide a solution to the Service Desk so that the incidents can be resolved.  The root cause and solution for the incident is logged in the Known Error Database (KEDB) that is owned by the Problem Management and shared with the service desk and other IT support staff. 
Reactive Problem management
Reactive Problem Management is a process that is primarily performed in support of Incident Management to ensure that the service desk has the solution to resolve the incident.  Problem Management will identify root cause of the incident.  in cases where resolving the cause might exceed the service level for the Mean Time To Restore Service (MTRS), Problem Management will attempt to provide a temporary solution to the service desk.  This enables the service desk agent to restore service as quickly as possible, meet the agreed targets and reduce the impact of the outage.  This temporary solution is referred to as a “Work Around”.   A very classic example might be something such as reboot the system.   The system is rebooted, the user is happy and running, Yay!  We met our SLA!  The resolution for a permanent solution might require an RFC.  This RFC would be submitted by problem management and might take two days, two weeks, or two months depending on the scope and complexity of this change to permanently fix the problem.  Although the “Incident” is closed in this example, the “Problem Record” will remain open until the resolution to the problem is completed.
If we the service provider follow industry best practice and track and record the incident record separately from the problem record our reports would show that we met the SLA!  Even though the problem could take days, or weeks to resolve, the problem record stays open until the permanent solution is implemented and recorded in the Known Error Database that is owned by the Problem Management Process.   We internally can generate management information on the “Problems” to determine the effort that was needed to resolve the problem, the cost to the business to resolve the problem, and also enable the service provider to demonstrate the value of IT to the business while building confidence of users and customers.   It is a beautiful thing!  If you like the value that this reactive side of “Problem Management” can bring to your business stay tuned for Part Two of “Problem Management for Newbies where the focus is on “Proactive Problem Management”!

 

Comments

Popular posts from this blog

The Four Ps of Service Design - It’s not all about Technology

People ask me why I think that many designs and projects often fail. The most common answer is from a lack of preparation and management. Many IT organizations just think about the technology (product) implementation and fail to understand the risks of not planning for the effective and efficient use of the four Ps: People, Process, Products (services, technology and tools) and Partners (suppliers, manufacturers and vendors). A holistic approach should be adopted for all Service Design aspects and areas to ensure consistency and integration within all activities and processes across the entire IT environment, providing end to end business-related functionality and quality. (SD 2.4.2) People:   Have to have proper skills and possess the necessary competencies in order to get involved in the provision of IT services. The right skills, the right knowledge, the right level of experience must be kept current and aligned to the business needs. Products:   These are the technology managem

What Is A Service Offering?

The ITIL4 Best Practice Guidance defines a “Service Offering” as a description of one or more services designed to address the needs of a target customer or group .   As a service provider, we can’t stop there!   We must know what the contracts of our service offering are and be able to put them into context as required by the customer.     Let’s explore the three elements that comprise a Service Offering. A “Service Offering” may include:     Goods, Access to Resources, and Service Actions Goods – When we think of “Goods” within a service offering these are the items where ownership is transferred to the consumer and the consumer takes responsibility for the future use of these goods.   Example of goods that are being provided in the offering – If this is a hotel service then toiletries or chocolates are yours to take with you.   You the consumer own these and they are yours to take with you.               Note: Goods may not always be provided for every Service Offe

What is the difference between Process Owner, Process Manager and Process Practitioner?

I was recently asked to clarify the roles of the Process Owner, Process Manager and Process Practitioner and wanted to share this with you. Roles and Responsibilities: Process Owner – this individual is “Accountable” for the process. They are the goto person and represent this process across the entire organization. They will ensure that the process is clearly defined, designed and documented. They will ensure that the process has a set of Policies for governance. Example: The process owner for Incident management will ensure that all of the activities to Identify, Record, Categorize, Investigate, … all the way to closing the incident are defined and documented with clearly defined roles, responsibilities, handoffs, and deliverables. An example of a policy in could be… “All Incidents must be logged”. Policies are rules that govern the process. Process Owner ensures that all Process activities, (what to do), Procedures (details on how to perform the activity) and the