Skip to main content

 Ever tried to get a teenager to do more chores around the home? For those without this joy in their lives, I’ll let you in on a secret - it goes down like a bucket of sick.

You can sometimes cajole them, sometimes bribe them and even threaten them (We’ll take away your laptop!) But at best, this has mixed results. You’ll often get an uptick in throughput - the rubbish & recycling will exit the apartment more often. But quality will suffer, the cacophony of bins being banged, incessant grumbling and milk cartons being scattered about will lead to you questioning many of your life choices.


This is often the case in life generally and software development in particular. The old adage “Faster, better, cheaper - Pick any 2” still holds true. Interestingly this isn’t always bad news for those in the business. More often it's a problem for customers, be they other teams or actual customers

For example, if you provide the means to produce faster - more people may be buying your tool. So in software development this might be a product like Claude Code, Cursor or Github Copilot. Making initial ‘coding’ quicker.

If we assume the business needs to maintain a certain level of quality to maintain market share or avoid prosecution & regulator scrutiny, then that increased ‘flow’ of features will need to be tested and fixed. Again a tool could help with this, but we need to be aware of the common method problem. The same approach used to create the code (with certain problems and biases) will likely test the code with the same biases and problem-blindness.

The same disgruntled teenager that can now “do” more will be less able to see the poor job they’ve done. Mind you, ChatGPT is always so much more polite than the average teenager. Maybe an AI dominated world would at least be more polite.

So ‘raising’ good software is becoming a more important job. Questioning whether the slick new app is reliable, accurate and performant will become a more involved, more challenging and more interesting job. As observers have noted, advances, many of them valuable, have a jagged edge. There are frequent gaps in the quality of applications and systems at the very time more people are able to produce more of them.

So when your bank replaces that old system that pays your mortgage every month, with a new application heavily built using AI assisted coding tools, also hope your bank hires skilled test engineers to investigate how well the systems actually work.


Comments

Popular posts from this blog

Don't be a Vogon, make it easy to access your test data!

 The beginning of the hitch-hikers guide to the galaxy leads with an alien ship about to destroy the Earth, and the aliens saying we (mankind) should have been more prepared – as a notice had been on display quite clearly – on Alpha Centauri the nearby star system, for 50 years. Seriously, people - what are you moaning about – get with the program?  The book then continues with the theme of bureaucratic rigidity and shallow interpretations of limited data. E.g. The titular guide’s description of the entire Earth is one word: “Harmless”, but after extensive review the new edition will state: “Mostly harmless”. Arthur Dent argues with the Vogons about poor data access This rings true for many software testing work, especially those with externally developed software, be that external to the team or external to the company. The same approaches that teams use to develop their locally developed usually don’t work well. This leads to a large suite of shallow tests that are usually h...

Can 'reasoning' LLMs help with recs data creation?

  A nervous tourist, glances back and forth between their phone and the street sign. They then rotate their phone 180 degrees, pauses, blink and frown. The lost traveller, flags a nearby ‘local’ (the passer-by has a dog on a lead.   “Excuse me…” she squeaks, “How may I get to Tower Hill?” “Well, that’ s a good one” ponders the dog walker, “You know…” “Yes?” queries the tourist hopefully. “Yeah…” A long pause ensues then, “Well I wouldn’t start from here” He states confidently. The tourist almost visibly deflates and starts looking for an exit. That’s often how we start off in software testing. Despite the flood of methodologies, tips on pairing, power of three-ing, backlog grooming, automating, refining and all the other … ings ) We often find ourselves having to figure out and therefore ‘test’ a piece of software by us ing it. And that’s good. Its powerful, and effective if done right. But, like our dog walker, we can sometimes find ourselves somewhere unfamiliar...

What possible use could Gen AI be to me? (Part 1)

There’s a great scene in the Simpsons where the Monorail salesman comes to town and everyone (except Lisa of course) is quickly entranced by Monorail fever… He has an answer for every question and guess what? The Monorail will solve all the problems… somehow. The hype around Generative AI can seem a bit like that, and like Monorail-guy the sales-guy’s assure you Gen AI will solve all your problems - but can be pretty vague on the “how” part of the answer. So I’m going to provide a few short guides into how Generative (& other forms of AI) Artificial Intelligence can help you and your team. I’ll pitch the technical level differently for each one, and we’ll start with something fairly not technical: Custom Chatbots. ChatBots these days have evolved from the crude web sales tools of ten years ago, designed to hoover up leads for the sales team. They can now provide informative answers to questions based on documents or websites. If we take the most famous: Chat GPT 4. If we ignore the...