Skip to main content

How-abouts and What-ifs


Happenstance. Ideally, I wouldn't have been reading two books at once. But I'd already started the one when the Dev manager chucked the other onto my desk as he stalked past, wild-eyed from some meeting, en route to the kitchen for a caffeine salve to the throbbing vein in his forehead. Intrigued (with the book not the vein, I've seen that plenty of times now), I started flicking through it, got hooked and then alternated between it and my own over the next few days, seeing the connections that both made to the, ah, idea that ideas (or lack of them) can be a problem.

The Dev manager's book, Fooled by Randomness by Nassim Nicholas Taleb, talks about consequences of an insufficient grasp of potential outcomes. Taleb's domain is financial markets and the instruments that populate them and he attributes the false confidence of many traders - and the population at large, by generalisation - to a lack of understanding of first possible scenarios and second the ability to make sensible estimates of their probabilities. Oh and, third, a flawed sense of their own prowess based on a misunderstanding of the extent to which their past performance was down to them or to chance.


In the the other book, Lateral Thinking by Edward de Bono, the proposal is that logical or vertical reasoning can only take you so far, that there are classes of solution that are unlikely or impossible to find by starting from a known state and simply reasoning. Lateral thinking, for de Bono, is a way to generate what-ifs and how-abouts rather than following existing lines of thought. He makes connections between lateral thinking, creativity, insight and humour, observing that a cornerstone of all is the ability to recast a set of circumstances. For example, humour frequently comes when pulling the rug from under a set of assumptions, exposing a different way of viewing a scenario. He asserts that the operation of the mind favours reinforcements of existing thought patterns and - perhaps less intuitively - that those patterns are significantly influenced by the order that information was first encountered. (Taleb has a similar concept of path dependence.)


Coincidence. Only that week I'd been talking to a couple of members of my team about trade-offs between (a) time spent thinking about a problem, researching it, exploring possible lines of attack, sketching out potential consequences, identifying commonalities and differences amongst the various approaches and (b) thinking of a plausible solution, diving in and just doing it. The knowledge gained from the former has the potential to significantly improve whatever action is ultimately taken. But it might also turn out to be worthless and still have consumed your budget. You might win big choosing the first idea, if it just works, or you might end up with a compromise when, late, you realise there's something significant you missed that you feel you could've and should've thought of. Barriers to trying the former can include functional fixedness, the open-ended nature of it, managers putting pressure on, fear of burning through the budget for a project without producing anything and the fact that JFDI can produce something good enough.

Taleb talks about the downsides of a narrow vision, saying that because of the relative infrequency of paradigm-shifting events - the kinds of thing a superficial or non-existent analysis would miss - and the large numbers of traders, many traders with little or no skill can do well by following a trend and/or having good fortune. When an unexpected event - a Black Swan, as in a later book of his  - does appear, a proportion will just happen not to be affected, will appear prescient, think what they did was the obvious thing, maybe have unwarranted strategic smarts attributed to them, will themselves reason out an explanation for their success - attribution and hindsight biases - and perhaps even pursue what they think they have been doing more aggressively afterwards.

To contextualise Taleb's example: assume that in any given year there's a 50% chance of a journeyman tester not being found out after missing some potentially serious bug. If we start with 1000 testers then, even after ten years the chances are that a handful will still have been fortunate enough not to have had adverse consequences (1000 in year 1, 500 in year 2, 250, 125, 64, 32, 16, 8, 4, 2, 1).

As a tester, as someone who invests time trying to balance the cost of applying effort, and where, against the risk of not applying it, it is humbling and worrying to think that the results of my work may be as much, or even more, down to luck as to anything I do or did. I may miss rare issues, glaring issues, trivial and severe issues but if they happen not to be encountered by someone who matters or at a time that matters or with an effect that matters then I may still be thought of as a success and I may still think of myself as a success and continue on in whatever approach I've been using. The one which assists me in missing those issues. A tester could go through a whole career making poor decisions but never seeing failure. You could be interviewing that person, with a stellar CV, right now, for that test manager position in your organisation.

So what is Taleb's prescription? First, it's selfish; to take advantage of it by invoking a strategy that aims to win big when the rare event occurs but at worst lose small at other times. More generally he suggests being aware of the human proclivity for being fooled means that you can at least take account of it. Further, being able to envisage situations other than the status quo will enable the probabilities of such events to be considered. Next, he counsels that:
Maximing the probability of winning does not necessarily maximise the expectation from the game when one loss is catastrophic
He's talking here about the fact that probability is insufficient on its own for evaluating risk. It has to be associated with some measure of cost in order to provide an expectation. For instance, suppose that as a trader the probability of losing £10000 on any given day is 1/1000. The cost is extreme, but the probability is small (to a human mind thinking along standard paths). Let's say that the probability of making £1000 is 4/1000 and of making £1 is 9995/1000. So in the course of 1000 days, or around three years, our expectation is a significant loss although almost every day shows some profit. If the trader never thinks of the 1/1000 event the expectation over the period would be an apparently safe, if modest, profit.

9995 x  1          £9995
4    x  1000       £4000
1    x -100000  -£100000
----             ------
1000             -£86005
       
Ideas are the currency of creativity. The fuel of furtherance. The driver of disruptive actions. Without ideas there's no route off the beaten track barring accident, and a strategy that relies on timely accidents for its sole source of innovation is setting its user up to fail at some point. Of course, Taleb is not the originator of expectation and de Bono is not alone in thinking about ways to provoke ideas (and he's moved things along since this particular 1970s work too) but, regardless of the sources, these notions can be useful to testers at both micro and macro levels.  Brian Eno famously has his Oblique Strategies cards, for example, and  various testers have suggested using similar schemes to assist with test idea generation.

Over the years I've come to value up-front exploration (including physical and thought experiments, proof-of-concepts, prototyping) to build a model of the problem space before implementation. Frequently, even where the budget for a project is tiny, I'll try to isolate some time, however small, for it. In particular, I've learned the hard way that my first idea for the solution to a problem is usually wrapped up in my notion about how I'd implement it given what I know about other implementations. You can get out of that mindset by consciously disentangling the what from the how, but you can step back further to deliberately make time to consider the what-ifs and how-abouts and it's here particularly that lateral thinking can help.

For de Bono, lateral thinking is a skill that you can choose to apply. His book is all about techniques for the generation of ideas, about disrupting thought patterns, the provocation of thinking outside of the norm, the understood. For him, by generating more suggestions you have a wider pool of starting points to consider. Some of them may, on evaluation, be clearly absurd or impractical, but may cause you to think of something else which isn't or spur someone else on to another chain of ideas which don't have the same flaws.

Logical reason and lateral leaps can be interleaved in any way, there is no need to stick exclusively to one approach; feedback from one round of work can and should influence the next round. He is enough of a realist to talk about stopping heuristics for a round of deliberate lateral thought (it's not rocket science: use time boxes, create a certain number of ideas ...).  Although the analogy is not complete, there's some parallel with the view that Rapid Software Testing has of the potential interplay of exploratory and scripted testing.

 

If you bring in the notion of expectation too (where the cost could be the risk of failure), you have some basic machinery for generating and comparing possibilities, for rudimentary prioritisation.

One application for lateral thinking that seems to get relatively little attention is that of identifying alternative solutions to something that already works. Frequently, we're approaching tasks from the point of view of an identified problem: test this new implementation, fix that abhorrent behaviour, improve the performance of that operation and so on. Letting yourself look at something extant and considered sufficient with a lateral eye can be productive (if not necessarily always popular).

On the other hand, lateral thinking may be less immediately applicable when investigating failures. This is often a place where reasoning from the known or observed is the best starting point. Looking at logs, inspecting customer reports and so on will often provide enough evidence for traditional logical reasoning to narrow down the problem. When you're involved in a live support call with a customer, suggestions from way left-field may not be appropriate, at least not until the obvious options have been exhausted.

Synchronicty. I came across the de Bono book while browsing the Pelican section of the Book Barn near Bristol (that day I also picked up Crosby's Quality is Free and a box of other stuff including Julie Burchill's caustic Love it or Shove it  all at a quid a pop!). I wasn't looking for it, but reading it put me in a position to make some new (to me) connections. Quite apart from the actions you take when confronted with a problem, that exposure to the ideas and experiences of others is another valuable way to increase your chances of enumerating possibilities. You never know when they'll come in, when the now will remind you of the then and spur that thought, that crucial what-if or how-about.

And so some time this week as I pace rapidly down the office, fresh from some meeting or other, on my way to walk briskly round the block, the vein in my temple pulsing and plum-coloured, I'll slap The Complete Plain Words onto the Dev manager's desk,  muttering "what if I just told them to ..." as I disappear round the corner.
Images: http://flic.kr/p/ca1gm, Amazonhttp://flic.kr/p/4qCTgp, RST v3.1.3

Comments

  1. Note that this example: "will themselves reason out an explanation for their success - so-called hindsight bias" - is actually called attribution bias.

    Hindsight bias is important as a trait possessed by decision makes who think that the event has been predictable and successfully predicted by some people with "more skill." Forecasters suffering from hindsight bias would be irrelevant if no one listened to them.

    ReplyDelete
  2. Cheers Peter.

    In the slightly larger context they were also post facto attributed with foresight ("will appear prescient") but you're right that that phrase could do with tightening. I'll make an edit.

    ReplyDelete

Post a Comment

Popular posts from this blog

Can Code, Can't Code, Is Useful

The Association for Software Testing is crowd-sourcing a book,  Navigating the World as a Context-Driven Tester , which aims to provide  responses to common questions and statements about testing from a  context-driven perspective . It's being edited by  Lee Hawkins  who is  posing questions on  Twitter ,   LinkedIn , Mastodon , Slack , and the AST  mailing list  and then collating the replies, focusing on practice over theory. I've decided to  contribute  by answering briefly, and without a lot of editing or crafting, by imagining that I'm speaking to someone in software development who's acting in good faith, cares about their work and mine, but doesn't have much visibility of what testing can be. Perhaps you'd like to join me?   --00-- "If testers can’t code, they’re of no use to us" My first reaction is to wonder what you expect from your testers. I am immediately interested in your working context and the way

Meet Me Halfway?

  The Association for Software Testing is crowd-sourcing a book,  Navigating the World as a Context-Driven Tester , which aims to provide  responses to common questions and statements about testing from a  context-driven perspective . It's being edited by  Lee Hawkins  who is  posing questions on  Twitter ,   LinkedIn , Mastodon , Slack , and the AST  mailing list  and then collating the replies, focusing on practice over theory. I've decided to  contribute  by answering briefly, and without a lot of editing or crafting, by imagining that I'm speaking to someone in software development who's acting in good faith, cares about their work and mine, but doesn't have much visibility of what testing can be. Perhaps you'd like to join me?   --00-- "Stop answering my questions with questions." Sure, I can do that. In return, please stop asking me questions so open to interpretation that any answer would be almost meaningless and certa

Testing (AI) is Testing

Last November I gave a talk, Random Exploration of a Chatbot API , at the BCS Testing, Diversity, AI Conference .  It was a nice surprise afterwards to be offered a book from their catalogue and I chose Artificial Intelligence and Software Testing by Rex Black, James Davenport, Joanna Olszewska, Jeremias Rößler, Adam Leon Smith, and Jonathon Wright.  This week, on a couple of train journeys around East Anglia, I read it and made sketchnotes. As someone not deeply into this field, but who has been experimenting with AI as a testing tool at work, I found the landscape view provided by the book interesting, particularly the lists: of challenges in testing AI, of approaches to testing AI, and of quality aspects to consider when evaluating AI.  Despite the hype around the area right now there's much that any competent tester will be familiar with, and skills that translate directly. Where there's likely to be novelty is in the technology, and the technical domain, and the effect of

Testers are Gate-Crashers

  The Association for Software Testing is crowd-sourcing a book,  Navigating the World as a Context-Driven Tester , which aims to provide  responses to common questions and statements about testing from a  context-driven perspective . It's being edited by  Lee Hawkins  who is  posing questions on  Twitter ,   LinkedIn , Mastodon , Slack , and the AST  mailing list  and then collating the replies, focusing on practice over theory. I've decided to  contribute  by answering briefly, and without a lot of editing or crafting, by imagining that I'm speaking to someone in software development who's acting in good faith, cares about their work and mine, but doesn't have much visibility of what testing can be. Perhaps you'd like to join me?   --00-- "Testers are the gatekeepers of quality" Instinctively I don't like the sound of that, but I wonder what you mean by it. Perhaps one or more of these? Testers set the quality sta

Postman Curlections

My team has been building a new service over the last few months. Until recently all the data it needs has been ingested at startup and our focus has been on the logic that processes the data, architecture, and infrastructure. This week we introduced a couple of new endpoints that enable the creation (through an HTTP POST) and update (PUT) of the fundamental data type (we call it a definition ) that the service operates on. I picked up the task of smoke testing the first implementations. I started out by asking the system under test to show me what it can do by using Postman to submit requests and inspecting the results. It was the kinds of things you'd imagine, including: submit some definitions (of various structure, size, intent, name, identifiers, etc) resubmit the same definitions (identical, sharing keys, with variations, etc) retrieve the submitted definitions (using whatever endpoints exist to show some view of them) compare definitions I submitted fro

Build Quality

  The Association for Software Testing is crowd-sourcing a book,  Navigating the World as a Context-Driven Tester , which aims to provide  responses to common questions and statements about testing from a  context-driven perspective . It's being edited by  Lee Hawkins  who is  posing questions on  Twitter ,   LinkedIn , Mastodon , Slack , and the AST  mailing list  and then collating the replies, focusing on practice over theory. I've decided to  contribute  by answering briefly, and without a lot of editing or crafting, by imagining that I'm speaking to someone in software development who's acting in good faith, cares about their work and mine, but doesn't have much visibility of what testing can be. Perhaps you'd like to join me?   --00-- "When the build is green, the product is of sufficient quality to release" An interesting take, and one I wouldn't agree with in general. That surprises you? Well, ho

Make, Fix, and Test

A few weeks ago, in A Good Tester is All Over the Place , Joep Schuurkes described a model of testing work based on three axes: do testing yourself or support testing by others be embedded in a team or be part of a separate team do your job or improve the system It resonated with me and the other testers I shared it with at work, and it resurfaced in my mind while I was reflecting on some of the tasks I've picked up recently and what they have involved, at least in the way I've chosen to address them. Here's three examples: Documentation Generation We have an internal tool that generates documentation in Confluence by extracting and combining images and text from a handful of sources. Although useful, it ran very slowly or not at all so one of the developers performed major surgery on it. Up to that point, I had never taken much interest in the tool and I could have safely ignored this piece of work too because it would have been tested by

Am I Wrong?

I happened across Exploratory Testing: Why Is It Not Ideal for Agile Projects? by Vitaly Prus this week and I was triggered. But why? I took a few minutes to think that through. Partly, I guess, I feel directly challenged. I work on an agile project (by the definition in the article) and I would say that I use exclusively exploratory testing. Naturally, I like to think I'm doing a good job. Am I wrong? After calming down, and re-reading the article a couple of times, I don't think so. 😸 From the start, even the title makes me tense. The ideal solution is a perfect solution, the best solution. My context-driven instincts are reluctant to accept the premise, and I wonder what the author thinks is an ideal solution for an agile project, or any project. I notice also that I slid so easily from "an approach is not ideal" into "I am not doing a good job" and, in retrospect, that makes me smile. It doesn't do any harm to be reminded that your cognitive bias

Test Now

The Association for Software Testing is crowd-sourcing a book,  Navigating the World as a Context-Driven Tester , which aims to provide  responses to common questions and statements about testing from a  context-driven perspective . It's being edited by  Lee Hawkins  who is  posing questions on  Twitter ,   LinkedIn , Mastodon , Slack , and the AST  mailing list  and then collating the replies, focusing on practice over theory. I've decided to  contribute  by answering briefly, and without a lot of editing or crafting, by imagining that I'm speaking to someone in software development who's acting in good faith, cares about their work and mine, but doesn't have much visibility of what testing can be. Perhaps you'd like to join me?   --00-- "When is the best time to test?" Twenty posts in , I hope you're not expecting an answer without nuance? You are? Well, I'll do my best. For me, the best time to test is when there

Play to Play

I'm reading Rick Rubin's The Creative Act: A Way of Being . It's spiritual without being religious, simultaneously vague and specific, and unerring positive about the power and ubiquity of creativity.  We artists — and we are all artists he says — can boost our creativity by being open and welcoming to knowledge and experiences and layering them with past knowledge and experiences to create new knowledge and experiences.  If that sounds a little New Age to you, well it does to me too, yet also fits with how I think about how I work. This is in part due to that vagueness, in part due to the human tendency to pattern-match, and in part because it's true. I'm only about a quarter of the way through the book but already I am making connections to things that I think and that I have thought in the past. For example, in some ways it resembles essay-format Oblique Strategy cards and I wrote about the potential value of them to testers 12 years ago. This week I found the f