Erhalten Sie Zugang zu diesem und mehr als 300000 Büchern ab EUR 5,99 monatlich.
Today's programmers don't develop software systems from scratch. instead, they spend their time fixing, extending, modifying, and enhancing existing software. Legacy systems often turn into an unwieldy mess that becomes increasingly difficult to modify, and with architecture that continually accumulates technical debt. Carola Lilienthal has analyzed more than 300 software systems written in Java, C#, C++, PHP, ABAP, and TypeScript and, together with her teams, has successfully refactored them. This book condenses her experience with monolithic systems, architectural and design patterns, layered architectures, domain-driven design, and microservices. With more than 200 color images from real-world systems, good and sub-optimal sample solutions are presented in a comprehensible and thorough way, while recommendations and suggestions based on practical projects allow the reader to directly apply the author's knowledge to their daily work. "Throughout the book, Dr. Lilienthal has provided sound advice on diagnosing, understanding, disentangling, and ultimately preventing the issues that make software systems brittle and subject to breakage. In addition to the technical examples that you'd expect in a book on software architecture, she takes the time to dive into the behavioral and human aspects that impact sustainability and, in my experience, are inextricably linked to the health of a codebase. She also expertly zooms out, exploring architecture concepts such as domains and layers, and then zooms in to the class level where your typical developer works day-to-day. This holistic approach is crucial for implementing long-lasting change." From the Foreword of Andrea Goulet CEO, Corgibytes, Founder, Legacy Code Rocks
Sie lesen das E-Book in den Legimi-Apps auf:
Seitenzahl: 360
Das E-Book (TTS) können Sie hören im Abo „Legimi Premium” in Legimi-Apps auf:
Carola Lilienthal is managing director of WPS – Workplace Solutions GmbH and is responsible for the department of software architecture. Since 2003, Dr. Carola Lilienthal has been analyzing over 300 architectures in Java, TypeScript, C #, C ++, ABAP and PHP, and advising development teams on how to improve the sustainability of their software systems. She is particularly interested in the education of software architects, which is why she regularly passes on her knowledge at conferences, in articles and training courses.
Carola Lilienthal
Analyze and Reduce Technical Debt
Carola Lilienthal
Editor: Michael Barabas / Christa Preisendanz
Copyeditor: Jeremy Cloot
Layout and type: Josef Hegele
Cover design: Helmut Kraus, www.exclam.de
Bibliographic information published by the Deutsche Nationalbibliothek (DNB)
The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data can be found on the internet at http://dnb.dnb.de
Copyright © 2019 dpunkt.verlag GmbH
Wieblinger Weg 17
69123 Heidelberg
ISBN:
Print (dt.)978-3-86490-673-2
Print (engl.) 978-1-68198-569-5
PDF978-3-96088-780-5
epub978-3-96088-781-2
mobi978-3-96088-782-9
Title of the German original: Langlebige Software-Architekturen.
Technische Schulden analysieren, begrenzen und abbauen
2., überarbeitete und erweiterte Auflage 2017
ISBN 978-3-86490-494-3
Distributed in the UK and Europe by Publishers Group UK and dpunkt.verlag GmbH.
Distributed in the U.S. and all other territories by Ingram Publisher.
Many of the designations in this book used by manufacturers and sellers to distinguish their products are claimed as trademarks of their respective companies. Where those designations appear in this book, and dpunkt.verlag was aware of a trademark claim, the designations have been printed in caps or initial caps. They are used in editorial fashion only and for the benefit of such companies, they are not intended to convey endorsement or other affiliation with this book. No part of the material protected by this copyright notice may be reproduced or utilized in any form, electronic or mechanical, including photocopying, recording, or by any information storage and retrieval system, without written permission of the copyright owner. While reasonable care has been exercised in the preparation of this book, the publisher and author assume no responsibility for errors or omissions, or for damages resulting from the use of the information contained herein.
The first time I met Dr. Carola Lilienthal was at a conference on Domain-Driven Design where we were both presenting. As she went through her slides and discussed many of the concepts in this book, two things became clear.
First, her understanding of software architecture is simultaneously deep and broad. The advice she gives comes from decades of hands-on experience working with hundreds of legacy systems. She works in the trenches, far removed from the ivory tower that so many software architects move into. Yet, technical as she is, her expertise transcends a single specialty. Her stories, examples, and advice apply to a wide range of domains, languages, and technical stacks. While it might be rare to find an individual with equal breadth and depth, it’s exactly the skill set that’s required for a book on making software architecture sustainable.
The second thing I noticed was the way she lit up when she talked. For her, making software systems more modular, extensible, and sustainable isn’t simply an intellectual or academic pursuit. It’s a passion. The joy, hope, and optimism she exudes when talking about improving software architecture is contagious. While this might seem trivial on the surface, make no mistake—that mentality is a crucial asset when you’re wading through the mire and chaos of a system bogged down with technical debt. So many systems are abandoned because their maintainers lacked not just the skill to make a change, but the hope that a change was even possible.
For these reasons, I’m thrilled that Dr. Lilienthal has decided to share her experience with the world through her book, Sustainable Software Architecture. Legacy code is notoriously difficult to work with for a variety of reasons, not the least of which is the amount of interdependency that exists within a system. Throughout the book, Dr. Lilienthal has provided sound advice on diagnosing, understanding, disentangling, and ultimately preventing the issues that make software systems brittle and subject to breakage. In addition to the technical examples that you’d expect in a book on software architecture, she takes the time to dive into the behavioral and human aspects that impact sustainability and, in my experience, are inextricably linked to the health of a codebase. She also expertly zooms out, exploring architecture concepts such as domains and layers, and then zooms in to the class level where your typical developer works day-to-day. This holistic approach is crucial for implementing long-lasting change.
There is an immense amount of gratification to be found in making a system sustainable. Too often in our society, “makers” (people who get the most joy during the initial phases of a project) are the ones who enjoy all the glory. However, it’s important to recognize the tremendous value that already resides within existing software systems, along with the value of “menders” like Dr. Lilienthal who are eager to dive in and make these systems better. Legacy code runs our world. While it might be tempting to bulldoze a system and start over, doing so, especially if done in haste, rarely accomplishes the goal that we ultimately want—software systems that are easier to modify, extend, grow, and maintain. The better path, the path that ultimately uses fewer resources, causes less frustration, and gets better results, is the path of committed sustainability, and taking the first step down that path is just a few page turns away.
Andrea Goulet
CEO, Corgibytes
Founder, Legacy Code Rocks
Dear Reader,
Welcome to this book on sustainable software architecture. In the following chapters I invite you to explore the inner life of software systems with me and admire the beauty and cruelty that can be found within.
In recent years, I have been fortunate enough to be allowed to look deeply into many software systems. In doing so, I have pondered and discussed with many architects, debating which structures are more sustainable than others and why. In this book you will find many recommendations on all aspects of developing sustainable systems. There are also real-world case studies, screenshots of real systems to help my observations come to life, and a little theory too, to help you quickly grasp the principles involved and retain what you learn.
I would be delighted if you could give me feedback on my experiences. Perhaps you have seen similar things, or perhaps you have a completely different view of software architecture. I hope you will find the following pages interesting, informative, discussion-worthy, and perhaps even amusing.
I would like to thank everyone who accompanied me on the journey that led to the creation of this book. Thanks go to my family who always support me in all my projects, even with an idea as crazy as writing a book. You are wonderful! Merci beaucoup!
Thanks go to everyone who reviewed the German version of the text: Eberhard Wolff, Gernot Starke, Johannes Rost, Stefan Sarstedt, Stefan Tilkov, Stefan Zörner, Tobias Zepter, Ulf Fildebrandt, and my anonymous reviewers. Your comments have made me think, rethink, and then think even further—thank you!
Thanks to all my colleagues at WPS Workplace Solution for the many discussions about software architecture, without which this book could never have been written: Holger Breitling, Kai Bühner, Guido Gryczan, Stefan Hofer, Bettina Koch, Jörn Koch, Michael Kowalczyk, Kai Rüstmann, Arne Scharping, Lasse Schneider, Henning Schwentner, and Heinz Züllighoven.
Thanks also to everyone who covered for me so that I could write in peace, especially Martina Bracht-Kopp, Inge Fontaine, Petra Gramß, Silja Heitmann, and Doris Nied.
Many thanks to Norman Wenzel, Heinrich Rust, and Alexander von Zitzewitz who worked with me for such a long time on the architectural review. It’s a privilege to work with you.
My thanks also go to everyone at my publisher. You have all been so kind and constructive while this book was taking shape.
And, of course, many thanks to all the customers who allowed me to tell you about their systems—you have made a really valuable contribution to this book. And finally, many thanks to my colleague Ben McDougall, who corrects my English texts with a lot of patience and humor.
Hamburg, June 2019
Carola Lilienthal
@cairolali
www.sustainable-software-architecture.com
1Introduction
1.1Software Architecture
1.2Sustainability
1.3Technical Debt
1.3.1No Knowledge of Software Architecture
1.3.2Complexity and Size
1.3.3Architectural Erosion Takes Place Unnoticed
1.3.4We Don’t Pay Extra For Quality!
1.3.5Types of Technical Debt
1.4The Systems I Have Seen
1.5Who Is This Book For?
1.6How To Use This Book?
2Tracking Down Technical Debt
2.1Building Block Terminology
2.2Target and Actual Architecture
2.3Improvements to a Running System
2.4False Positives and Generated Code
2.5Cheat Sheet for Sotograph
3Architecture in Programming Languages
3.1Java Systems
3.2C# Systems
3.3C++ Systems
3.4ABAP Systems
3.5PHP Systems
4Architecture Analysis and Improvement
4.1Developers and Architects
4.2Working on Architecture is a “Holschuld”
4.3Live Architecture Improvement Workshop
4.4Dealing with Mothers and Fathers
4.5Modularity Maturity Index (MMI)
4.6Technical Debt in the Lifecycle
5Cognitive Psychology and Architectural Principles
5.1Modularity
5.1.1Chunking
5.1.2Transfer to Design Principles
5.1.2.1Units
5.1.2.2Public Interface
5.1.2.3Coupling
5.2Pattern Consistency
5.2.1Establishing Schemata
5.2.2Transfer to Design Principles
5.3Hierarchy
5.3.1Formation of Hierarchies
5.3.2Transfer to Design Principles
5.5Consequences for Architectural Analysis
6Architectural Styles that Reduce Technical Debt
6.1Rules of Architectural Styles
6.2Separation of Business and Technical Building Blocks
6.3Layered Architecture
6.3.1Technical Layering
6.3.2Domain Layering
6.3.3The Infrastructure Layer
6.3.4Integration of Domain-oriented Layers
6.4Hexagonal, Onion, and Clean Architecture
6.5Microservices and Domain-Driven Design
6.6Pattern Languages
6.6.1The Tool & Material Pattern Language
6.6.2The DDD Pattern Language
6.6.3Typical Framework Patterns
6.7Sustainability and Architectural Styles
7Pattern in Software Architecture
7.1Mapping the Target to the Actual Architecture
7.2The Ideal Structure: Domain-Oriented or Technical?
7.3Public Interfaces for Building Block
7.4Interfaces: The Architectural Miracle Cure?
7.4.1Basic Therapy
7.4.2Side Effects
7.4.3Field Studies on “Living Patients”
7.4.4Fighting the Monolith
7.5The Need for Microservices
8Pattern Languages: A True Architectural Treasure!
8.1The Treasure Hunt
8.2Software Archaeology
8.3From the Treasure Chest
8.4How Much Gold Is There?
8.5Annual Growth Rings
8.6Unclear Patterns Provoke Cycles
9Chaos Within Layers: The Daily Pain
9.1Evaluating the Mess
9.1.1The Extent of the Chaos
9.1.1.1Architectural Styles and Cycles
9.1.1.2Lines of Code in Cycles
9.1.1.3Dependency Injection and Cycles
9.1.2Scope and Interconnectedness
9.1.3Cycle Range Within an Architecture
9.2The Big Ball of Mud
9.2.1The Black Hole Effect
9.2.2Breaking Free
9.2.3“Weaponizing” Technical Layering
9.2.4Pattern Language as a Lighthouse
9.3Uneven Modules
10Refining Modularity
10.1Building Block Cohesion
10.2Building Block Sizes
10.3Class Sizes
10.4Method Size and Complexity
10.5Loose Coupling
10.6Coupling and Class Size
10.7How Modular Are You?
11Real-World Case Studies
11.1The Java System Alpha
11.2The C# System Gamma
11.3The C++ System Beta
11.4The Java System Delta
11.5The Java System Epsilon (with C# Satellites)
11.5.1Java-Epsilon
11.5.2C#-Epsilon 1
11.5.3C#-Epsilon 2
11.6The ABAP system Lambda
12Conclusion: The Path Toward Sustainable Architecture
Appendix
AAnalysis Tools
A.1Lattix
A.2Sonargraph Architect
A.3Sotograph and SotoArc
A.4Structure101
References
Index
Software systems are certainly among the most complex constructions that human beings have ever conceived and built, so it’s not surprising that some software projects fail, and legacy systems often remain unmodified for fear they will simply stop working. In spite of this complexity, I still encounter project teams that are in control of their software systems, regardless of their industry, technology stack, size, or age. Adding functionality and fixing bugs in such legacy systems involves much less effort than I would have imagined, and new employees can be trained with reasonable effort. What do these project teams do differently? “How do they manage their software so effectively in the long run?”
Sustainability of software architectures
The main reasons for long-term success or failure in software development and maintenance can be found on many different levels. These include the industry, the technology used, the quality of the software system, and the qualifications of the users and developers. This book focuses on the sustainability of software architecture. I will show you which factors are most important for maintaining and expanding a software architecture over many years without making significant changes to your staffing, budget, or delivery schedule.
50 definitions
Computer science has not been able to commit itself to a single definition of software architecture. In fact, there are more than 50 different definitions, each highlighting specific aspects of architecture. In this book we will stick to two of the most prominent definitions:
Definition #1:
“Software architecture is the structure of a software product. This includes elements, the externally visible properties of the elements, and the relationships between the elements.” [Bass et al. 2012]
Architecture Views
This definition deliberately talks about elements and relationships in very general terms. These two basic materials can be used to describe a wide variety of architecture views. The static (module) view contains the following elements: classes, packages, namespaces, directories, and projects—in other words, all the containers you can use for programming code in that particular programming language. In the distribution view, the following elements can be found: archives (JARs, WARs, assemblies), computers, processes, communication protocols and channels, and so on. In the dynamic (runtime) view we are interested in the runtime objects and their interactions. In this book we will deal with the structures in the module (static)1 view and show why some are more durable than others.
The second definition is one that is very close to my heart. It doesn’t define architecture by way of its structure, but rather the decisions made.
Definition #2:
Structure vs. decisions
These two definitions are very different. The first defines what the structure of a software system consists of on an abstract level, whereas the second refers to decisions that the developers or architects make regarding the system as a whole. The second definition defines the space for all overarching aspects of architecture, such as technology selection, architectural style selection, integration, security, performance, and much, much more. These aspects are just as important to an architecture as the chosen structure, but are not the subject of this book.
Decisions create guardrails
This book deals with the decisions that influence the structure of a software system. Once a development team and its architects decide on the structure of a system, they have defined guardrails for the architecture.
Guardrails for your architecture
Create an architecture that restricts the design space during the development of the software system and gives you direction in your work.
Guardrails for development
Guardrails allow developers and architects to orient themselves. The decisions are all channeled in a uniform direction and can be understood and traced, giving the software system a homogeneous structure. When solving maintenance tasks, guardrails guide all participants in a uniform direction, and lead to faster and more consistent results during adaptation or extension of the system.
This book will answer questions regarding which guardrails lead to durable architectures and extend the life of a software system.
Short-lived software
Software that is only used for a short period of time shouldn’t have an architecture that is designed for sustainability. An example of such a piece of software is a program that migrates data from a legacy system into the database of a new application. This software is used once and then hopefully discarded. We say “hopefully” because experience has shown that program parts that are no longer used can be still found in many software systems. They are not discarded because the developers assume that they might need them again later. Also, to delete lines of working code that were created with a lot of effort isn’t done lightly. There is hardly a developer or architect who likes to do this.2
The “Year 2000” problem
Most of the software we program today lives much longer than expected. It is often edited and adapted. In many cases software is used for many more years than anyone could have imagined at the coding stage. Think, for example, of the Cobol developers who wrote the first major Cobol systems for banks and insurance companies in the 1960s and 1970s. Storage space was expensive at the time, so programmers thought hard about preserving storage space for every field saved on the database. For the Cobol developers at the time, it seemed a reasonable decision to implement years as two-digit fields only. Nobody imagined back then that these Cobol programs would still exist in the year 2000. During the years prior to the turn of the millennium, a lot of effort had to be made to convert all the old programs to four-digit year fields. If the Cobol developers in the 1960s and 1970s had known that their software would be in service for such a long time, they would have used four-digit fields to represent years.
Our software will get old
Such a long lifetime is still realistic for a large number of the software systems that we build today. Many companies shy away from investing in new development, which generates significant costs that are often higher than planned. The outcome of new developments is also unknown, and users too have to be taken into consideration. In addition, the organization is slowed down during the development process and an investment backlog arises for urgently needed extensions. At the end of the day, it is better to stick with the software you have and expand it if necessary. Perhaps a new front end on top of an old server will suffice.
Old and cheap?
This book is rooted in the expectation that an investment in software should pay for itself for as long as possible. New software should incur the lowest possible maintenance and expansion costs in the course of its lifetime—in other words, the technical debt must be kept as low as possible.
The term “technical debt” is a metaphor coined by Ward Cunningham in 1992 [Cunningham 1992]. Technical debt arises when false or suboptimal technical decisions are made, whether consciously or unconsciously. Such decisions lead to additional effort at a later point in time, which delays maintenance and expansion.
Good intentions
If there are capable developers and architects on the team at the beginning of a software development project, they will contribute their best experience and accumulated design know-how to creating a long-lasting architecture with no technical debt. However, this goal cannot be ticked off at the beginning of the project according to the principle “First we will design a long-lasting architecture and everything else will be fine from then on.”
In truth, you can only achieve a long-lasting architecture if you constantly keep an eye on technical debt. In figure 1-1 we see what happens when technical debt grows over time in comparison to what happens when it is reduced regularly.
Figure 1-1 Technical debt and architectural erosion
A quality-oriented team
Imagine a team that is continuously developing a system using releases or iterations: if the team focuses on quality it will knowingly pile on new technical debt with each add-on (the yellow arrows in fig. 1-1). During the development of an expansion, the team will already be thinking about what needs to be done to improve the architecture. Meanwhile (or after the expansion), technical debt will be reduced again (indicated by the green arrows in fig. 1-1). A constant sequence of expansion and improvement occurs. If the team works this way, the system remains in a corridor of low technical debt with a predictable maintenance effort (see green bracket in fig. 1-1).
A chaotic team
If the team doesn’t aim to constantly preserve the architecture, the architecture of the system is slowly lost, and maintainability deteriorates. Sooner or later, the software system leaves the corridor of minor technical debt (indicated by the ascending red arrows in fig. 1-1).
Architectural erosion
The architecture erodes further and further. Maintenance and the extension of the software become increasingly expensive until you reach the point where every change becomes a painful effort. In figure 1-1 this case is made clear by the red arrows becoming shorter and shorter. As the erosion of the architecture increases, less and less functionality can be implemented, fewer bugs can be fixed, and fewer adaptations to other quality requirements can be achieved per unit of time. The development team becomes frustrated and demotivated and sends desperate signals to the project’s management. Such warnings are usually registered far too late.
Too expensive!
If you are on the path depicted by the ascending red arrows, the sustainability of your software system will continuously decrease. The software system becomes error-prone, the development team gets a reputation for being sluggish, and changes that used to be possible within two person-days now take up to twice or three times as long. All in all, everything happens much too slowly. In the IT industry, “slow” is a synonym for “too expensive”. That’s right, technical debt has accumulated and with every change you have to pay interest on the technical debt principal plus the cost of the expansion.
Returning to good quality
The way out of this technical debt dilemma is to retroactively improve architectural quality. As a result, the system can be pulled back into the corridor of low technical debt step by step (see the red descending arrows in fig. 1-1). This path requires significant resources (both time and money) but still represents a reasonable investment in the future. After all, future maintenance will involve less effort and will be cheaper. For software systems that once had good architecture, this procedure usually leads to rapid success.
The CRAP cycle
The situation is completely different if the corridor of high technical debt is reached and the maintenance effort becomes disproportionately high and unpredictable (see the red bracket in fig. 1-1). I am frequently asked what “disproportionately high maintenance” means. A general answer that is valid for all projects is of course difficult to provide. However, in various systems with good architecture I have noticed that for every 500,000 lines of code (LOC), one or two full-time developers are required for maintenance. In other words, 40-80 hours per week per 500,000 LOC is a good starting point for determining the time needed to fix bugs and make small adjustments. If new functionality is to be integrated into the system, you will of course require even more capacity.
When I visit a company to evaluate an architecture, the first question I ask is about the size of the system(s). Secondly, I ask about the size and efficiency of the development department. If the answer is, “We employ 30 developers for our Java system of 3 million LOC, but they are all busy with maintenance and we can hardly get any new features implemented …” I immediately assume that it is an indebted system. Naturally, setting such an expectation is harsh, but it has usually proved helpful as an initial hunch.
If the system has too much debt to be maintainable and extensible, companies often decide to replace the system with a new one (see the colored circle in fig. 1-1). In 2015, to my great delight, Peter Vogel described the typical lifecycle of a system with technical debt as a “CRAP cycle”. The acronym CRAP stands for C(reate) R(epair) A(bandon) (re)P(lace)3. If repairing a system seems fruitless or too expensive, the system is left to die and eventually replaced.
However, this last step should be approached cautiously. As early as the beginning of the 2000s, many legacy systems written in COBOL and PL/1 were declared unmaintainable and replaced by Java systems. Everything was supposed to get better with this new programming language, and this promise was made to the managers who were tasked with funding the new implementation. Today, a number of these eagerly built Java systems are full of technical debt and generate immense maintenance costs.
Causes of technical debt
In the course of my professional life to date, I have repeatedly encountered four major causes of technical debt:
No knowledge of software architecture
Complexity and size of software systems
Architectural erosion arises unnoticed
A lack of understanding of custom software development processes on the part of managers and customers
These four factors usually occur in combination and often influence each other.
Programming ≠ Software Architecture
When a development team starts a new project, I always try to include an experienced developer-architect in the team. Every developer can program, but knowledge of sustainable software architecture only comes with experience.
Unusable software
If nobody in the team cares about sustainable architecture, the resulting system is likely to be maintenance-intensive. The architecture of these systems evolves over a period without any planning. Each developer fulfills her own personal ideas on architecture and/or design for her part of the software. “It’s a legacy system!” is often heard in the latter case.
Starting with technical debt
In this case, technical debt is accumulated right from the beginning of and increases continuously. The usual attitude to such software systems is that they somehow grew up under a bad influence. Systems like this can often no longer be maintained after a relatively short period of time. I have even seen systems that have become unmaintainable after just three years.
Significant refactoring
The architectural and design ideas of architects and developers must initially be questioned and their quality standardized in order to move these systems closer to the corridor of low technical debt using whatever possible means. Overall, this is much more complex than getting a system with previously good architecture back on track. However, large-scale quality refactoring can be broken down into manageable sub-steps. After some initial small improvements (quick wins) the quality gain becomes noticeable through faster maintenance. Such quality-improving work often costs less than a new implementation, even if many development teams understandably enjoy new development projects much more. This positive attitude to a new development project is often accompanied by underestimation of the complexity of the task.
The complexity of a software system is fed by two different sources: the use case for which the software system was built and the solution (the program code, the database, and so on).
An appropriate solution for the problem must be found within its specialized domain—a solution that allows the user to carry out the planned business processes using the software system. These factors are known as “problem-inherent” and “solution-dependent” complexity. The greater the complexity of the problem, the greater the solution-dependent complexity will be4.
Problem-inherent complexity
This correlation is the reason why cost predictions and software development duration are often estimated too low. The actual complexity of the problem cannot be determined at the beginning of the project, so the complexity of the solution is underestimated many times over5.
This is where agile methods apply. Agile methods only estimate the functionality that is to be implemented up to the end of each iteration. The complexity of the problem and the resulting complexity of the solution are rechecked time and again.
Solution-dependent complexity
Not only the complexity inherent to the problem is difficult to determine. The solution, too, contributes to the complexity. Depending on the experience and methodical strength of the developers, the design and implementation of a problem will vary in complexity. Ideally, a solution will only be as complex as the problem. In this case, we can say that it is a good solution.
Essential or accidental?
If the solution is more complex than the actual problem, the solution is not a good one and a corresponding redesign is necessary. The difference between better and worse is called the essentialand accidental complexity. Table 1-1 summarizes the relationship between these four complexity terms.
Table 1-1Complexity
Essential
Accidental
Problem-inherent
Complexity of the domain
Misunderstandings about the domain
Solution-dependent
Complexity of technology and architecture
Misunderstandings about technology
Superfluous solution elements
Essential complexity is the kind of complexity that is inherent in the nature of a project. When analyzing the domain, developers try to identify the essential complexity of the problem. The essential complexity inherent to a domain leads to a correspondingly complex solution and can never be resolved or avoided just by using a particularly good design. The essential complexity of the problem has thus become the essential complexity of the solution.
In contrast, the term “accidental complexity” is used to refer to the elements of complexity that are not necessary and can therefore be eliminated or reduced. Accidental complexity can arise from misunderstandings during analysis of the domain as well as during implementation by the development team.
If no simple solution is found during development due to incomprehension or lack of an overview, the software system is already unnecessarily complex. Examples of unnecessary complexity are multiple implementations, integration of unneeded functionality, and disregard of software design principles. However, developers sometimes risk additional accidental complexity if, for example, they want to try out new but unnecessary technology during development.
Software is complex
Even if a team manages to incorporate only essential complexity into its software, the immense number of elements involved makes software difficult to master. In my experience, an intelligent developer can retain an overview of about 30,000 lines of code and anticipate the effects of code changes in the other places. Software systems in productive use today tend to be considerably larger than this. We are more likely talking about a range of 200,000 to 100 million lines of code.
Architecture reduces complexity
All these arguments make it clear that developers require software architecture that gives them the greatest possible overview. Only then can they navigate their way around the existing complexity. If developers have an overview of the architecture, the probability of appropriate software changes being made increases. When they make changes, they can take all of the affected areas into account and leave the functionality of the unaltered lines of code untouched. Of course, additional techniques are very helpful, such as automated testing, high test coverage, architectural education/training, and a supportive project and enterprise organization.
A drawn-out process
Even with a capable development team, architectural erosion occurs unnoticed. How does this happen? Well, it’s often a long, drawn-out process. During implementation, developers increasingly deviate from the architecture. In some cases, they do this consciously because the planned architecture does not meet the increasingly evident requirements. The complexity of the problem and the solution were underestimated and demands changed within the architecture, but there is no time to consistently follow these changes through for the entire system. In other cases, time and cost issues arise and must be solved so quickly that there is no time to develop a suitable design and rethink the architecture. Some developers are not even aware of the planned architecture, so they unintentionally violate it. For example, relationships are built between components that disregard prescribed public interfaces or run contrary to the modularization and layering of the software system. By the time you notice this creeping decay, it is high time to intervene!
Symptoms of severe architectural erosion
Once you have reached the nadir of architectural erosion, every change becomes unbearable. No-one wants to continue working on such a system. In his article Design Principles and Design Pattern, Robert C. Martin summed up these symptoms of a rotten system [Martin 2000]:
Rigidity
Rigidity:
The system is inflexible to modification. Each modification leads to a cascade of further adjustments in dependent modules. Developers are often unaware of what is happening in the system and are uncomfortable with changes. What starts as a small adjustment or a small refactoring leads to an ever-increasing marathon of repairs in ever more modules. The developers chase the effects of their modifications in the source code and hope to have reached the end of the chain with every new realization.
Fragility
Fragility:
Changes to the system result in errors that have no obvious relationship to the modifications made. Each adjustment increases the probability of new subsequent errors in surprising locations. The fear of modification grows, and the impression is that the developers are no longer in control of the software.
Immobility
Immobility:
There are design and construction units that already solve a similar task as the one that is currently being implemented. However, these solutions cannot be reused because there is too much “baggage” surrounding the unit in question. A generic implementation or separation is also not possible because reconstructing the old units would be too complex and error-prone. Usually the required code is copied, as this requires less effort.
Viscosity
Viscosity:
If developers need to make an adjustment, there are usually several options. Some of these options preserve the design, while others break it. If such “hacks” are easier to implement than the design-preserving solution, the system is described as viscous.
Development teams must constantly fight these symptoms to keep their systems durable and make customizing and maintenance fun in the long run. If only the costs didn’t exist …
Architecture costs extra money
Many customers are surprised when their service providers—either external or in-house—tell them that they need more money to improve the architecture and thus the quality of the software system. Customers often say things like, “It already works! What do I gain if I spend money on quality?” or, “You got the contract because you promised you would deliver good quality. You can’t demand more money for quality now”. These are very unpleasant situations. As software developers and architects, our goal is to write software with sustainable architecture and high quality. At this point, it is not easy to explain that an evolving architecture is an investment in the future and saves money in the long run.
These situations often arise because the customer/management doesn’t realize (or doesn’t want to know) that custom software development is an unplannable process. If new, unprecedented software is developed, the essential complexity is difficult to master. The software itself, its use, and its integration into the context of work organization and changing business processes are unpredictable. Possible extensions or new forms of use cannot be foreseen. These are essential characteristics of custom software development!
Software ≠ industrially produced goods
Today, every software system is custom-developed software and integration into the customer’s IT landscape is different every time. The technological and economic developments are so rapid that a software architecture and the resulting system that represents the ideal solution for today will reach its limits by tomorrow. These constraints lead to the conclusion that software is not an industrially manufactured product. Instead, it is a custom solution that makes sense at a given point in time, with an architecture that will hopefully endure for a long time but that must continue to evolve. This includes both functional and non-functional aspects, such as internal and external quality.
Fortunately, increasing numbers of customers are beginning to understand the terms “technical debt” and “sustainability.
Many types of technical debt and their variants are mentioned in discussions about technical debt. Four of these are relevant to this book:
Code Smell
Implementation debt:
The source code contains “code smells”, such as long methods, code duplicates, and similar.
Structural Smell
Design and architecture debt:
The design of classes, packages, subsystems, layers, and modules is inconsistent or complex and does not fit the planned architecture.
Unit Tests
Test debt:
Tests are missing or only the positive case is tested. The test coverage with automated unit tests is low.
Documentation
Documentation debt:
There is no documentation, or the documentation that exists is incomplete or outdated. The overview of the architecture is not supported by the documents. Design decisions are not documented.
Basic requirement: low test debt
Most of the hints, suggestions, and good and bad examples you will find in this book relate to the first two types of technical debt. You will see how such debt arises and how it can be reduced. However, this debt can only be reduced safely if the level of test debt is low or is reduced while you work. In this respect, low test debt is a basic requirement. On the one hand, documentation of the architecture is a good basis for the architectural analysis and improvements dealt with in this book (i.e., low documentation debt helps with the analysis). On the other hand, architectural analysis also produces documentation for the analyzed system, thus also reducing documentation debt.
After completing my computer science studies in 1995 I worked as a software developer, before moving on to software architecture, project management, and consultancy. Since 2002 I have often been invited to examine the quality of software systems. In the beginning I was only able to look at the source code, but this aspect has been tool-supported since 2004. Architectural analysis and improvement developed with thorough, tool-assisted checks of the source code according to certain criteria. In Chapters 2 and 4 you will see how analysis and improvement take place technically and organizationally.
Sizes and languages
In the course of time, I have reviewed systems written in Java (130), C++ (30), C# (70), ABAP (5), PHP (20), and PLSQL (10). TypeScript and JavaScript will soon be added to this list. Each of these programming languages has its own peculiarities, as we will see in Chapter 3. The size of a system (see fig. 1-2) also influences how the software architecture is (or should be) designed.
Figure 1-2 Sizes of analyzed systems in lines of code (LOC)
Lines of code
The “lines of code” (LOC) specification in figure 1-2 includes the lines of executable code, blank lines, and comments. If you exclude comments and blank lines, you need to subtract an average of 50 per cent of the LOC. Typically, the ratio between executable and non-executable code is between 40 and 60 per cent, depending on whether the development team placed begin and end markers ({}) on separate lines. The sizes of systems analyzed in this book are quoted for code written in Java/C#, C++, and PHP. These languages all have a similar “sentence length”. ABAP programming is much more wordy and generates two or three times as much source code.
All these analysis have sharpened and deepened my understanding of software architecture and my expectations of how software systems should be built.
Programming
This book is written for architects and developers who work with source code on a daily basis. They will benefit most from this book because it points out potential problems in large and small systems as well as offering solutions.
Improving existing systems
Consultants with development experience, practicing architects, and development teams who want to methodically improve existing software solutions will find many references to large and small improvements in this book.
Learning for the future
Inexperienced developers will probably have trouble understanding the content in some places due to the complexity of the issues involved. However, they will still be able to learn the basics of how to build sustainable software architectures.
The book consists of twelve chapters, some of which follow on from one another, but that can also be read separately.
Main contents
Figure 1-3 shows a schematic of the book’s chapters. The two white chapters provide the basic framework of introduction and summary. The turquoise chapters are the theoretical parts of the book, and the dark blue chapter deals with organizational aspects. The light blue chapters contain many small and large practical examples.
Different Paths
Ideally, you will read the entire book from start to finish. However, if your time is limited, I recommend you read Chapters 1 and 2 first to give you a good foundation. You can then skip to Chapters 5 and/or 6 followed by Chapters 7, 8, 9, or 10. You can also jump directly from Chapter 2 to Chapters 4 or 8, and dive right into the procedure of architectural analysis or the case studies.
Figure 1-3 Structure of the book
Chapter 1 lays the foundation for understanding sustainable architectures and technical debt. Chapter 2 shows how to find and reduce technical debt in existing architectures. Chapter 3 explains the specialties of programming languages in architectural analysis. Chapter 4 explains which roles in architectural analysis and improvement have to work together to achieve a valuable result, and how architectures can be compared using the Modularity Maturity Index (MMI). Chapter 5 deals with the question of how large structures must be designed so that people can quickly navigate their way around them. Cognitive psychology gives us clues as to which specifications lead to architectures that can be quickly grasped and understood. Chapter 6 presents the architectural styles commonly used today. With their rules, they provide guardrails for software architectures. Chapters 7, 8, 9, and 10 describe the findings from various practical, real-world analysis and consultations. Chapter 11 contains seven exemplary (anonymized) case studies that I find particularly interesting. To conclude, Chapter 12 contains a brief summary of how architects, development teams, and management should proceed to improve the quality of their architecture. The appendix presents a range of analysis tools that I have enjoyed using in the course of my everyday work.
The analysis of technical debt is a broad field. It can be a matter of questioning the architecture of a system that had previously been planned on paper, or you can work out quality assurance goals for an architecture with the customer and fine-tune them using scenarios as detailed in the ATAM1 architecture analysis method. However, this book takes a different approach, analyzing the source code of multiple systems in a search for sustainability and technical debt.
The best way to build sustainable software is covered in Chapter 5