Skip to main content

Random text tool

I recently blogged about some of the tools I use, and how some are so useful I keep using them. As I mentioned, randomness is pretty useful, and I have tools to help me generate random text.

A few of my readers requested a copy of my simple random text generating script, so I've decided to open it up for everyone to use and test. It will have bugs, like all software, please send details and I'll try and fix them.

If you are interested in what UTF-8 is and what all that Unicode stuff is about, there is a great article by Joel Spolsky that explains all, and the wikipedia page is ok.

To use it...

First download the script, its on GitHub. The script is fairly short and is all in one file. You don't have to 'install it', its not a GEM.

Second, make sure you have Ruby version 1.9 or greater. You need version 1.9, because Ruby didn't handle UTF-8 well in older versions.

Thirdly run the script like this:

ruby fuzzutf8.rb

That will give you some usage examples.

Typically you might use it like this:

ruby fuzzutf8.rb -c 100 -utf8_two_byte
ծݸԒ̂ƍՔ͌вЊޏΒđµʠ£أҋ‘ŷݟߢ؁ˬУԝψĭ͕ѼХׅ۲ѨÃːׁǡ ܮ͂ϗՎœߔƭ̷Ȼ̧ʕʏٔǝ͈ܰȄبǚŎڿξަدʓ׽߷ЊݑڷũɛӥѤ̸ôߋȭԭæúѫܚʽͫʔƦܾ߷Ûƕͱ՛عӯ̨֢د߻Ӈ

That's a hundred code-points of UTF-8, in the two byte range.

As operating systems sometimes have issues copying and pasting these code-points (e.g. Windows XP) You may prefer to redirect the contents straight to a text file. e.g.:

ruby fuzzutf8.rb -c 100 -utf8_two_byte > random_utf8_file.txt

You can then treat the output like any other file. You can open it in a text editor, but beware many programs can not handle random UTF8 well, You may wish to save your work first.

The script also allows you to keep a record of the code-point numbers generated in a separate file ( -o option). These can then be replayed at a later date should you need to ( -r option ). The script doesn't do a lot of validation, so you can probably have great fun tripping the script itself up with dodgy inputs and arguments ;-)

I'm confident the script has many bugs, but it is nonetheless quite useful, and has been useful in helping me to locate other bugs. If you are interested in random text generators, There is a windows tool that I've also found useful, called Babel from TestMentor. Babel allows you to choose the output by character set/codepoint range, which is also interesting.

Comments

Popular posts from this blog

Can Gen-AI understand Payments?

When it comes to rolling out updates to large complex banking systems, things can get messy quickly. Of course, the holy grail is to have each subsystem work well independently and to do some form of Pact or contract testing – reducing the complex and painful integration work. But nonetheless – at some point you are going to need to see if the dog and the pony can do their show together – and its generally better to do that in a way that doesn’t make millions of pounds of transactions fail – in a highly public manner, in production.  (This post is based on my recent lightning talk at  PyData London ) For the last few years, I’ve worked in the world of high value, real time and cross border payments, And one of the sticking points in bank [software] integration is message generation. A lot of time is spent dreaming up and creating those messages, then maintaining what you have just built. The world of payments runs on messages, these days they are often XML messages – and they ...

What possible use could Gen AI be to me? (Part 1)

There’s a great scene in the Simpsons where the Monorail salesman comes to town and everyone (except Lisa of course) is quickly entranced by Monorail fever… He has an answer for every question and guess what? The Monorail will solve all the problems… somehow. The hype around Generative AI can seem a bit like that, and like Monorail-guy the sales-guy’s assure you Gen AI will solve all your problems - but can be pretty vague on the “how” part of the answer. So I’m going to provide a few short guides into how Generative (& other forms of AI) Artificial Intelligence can help you and your team. I’ll pitch the technical level differently for each one, and we’ll start with something fairly not technical: Custom Chatbots. ChatBots these days have evolved from the crude web sales tools of ten years ago, designed to hoover up leads for the sales team. They can now provide informative answers to questions based on documents or websites. If we take the most famous: Chat GPT 4. If we ignore the...

Micropython + LoRaWAN = PyLoRaWAN

I recently open sourced a simple Micropython library for LoRaWAN on the Raspberry Pi Pico.  (If you are interested, You can find it on GitHub .) If you are unsure what that all means, let me unpack it for you... Micropython is a slimmed down version of Python 3.x that works on microcontrollers like the Raspberry Pi Pico, and a host of other microcontroller boards .  LoRaWAN is a wireless communication standard that is ideal for long range, low power & low band width data transmission. Its based on a clever technique for making signals work well over distance, called LoRa. The library I've shared is a wrapper around the existing LoRaWAN support provided by the RAK Wireless 4200 board. The RAK4200  (affiliate link) essentially provides a modem, that can establish a connection to the network and relay messages. It uses the traditional AT command syntax (used by the modems of yore!) The Pico and RAK4200 Evaluation board (there is also a UPS under the Pico there - that's...