I’ve always admired the translations of Chinese poetry – I’m no expert on the field, but there are two poets named Du Fu and Li Bai that I really like. They were legendary masters from the Great Tang Dynasty, and (if the translations are accurate), they had a phenomenal talent for freezing a moment and capturing that particular slice of time with their words; their poems read like a string of Polaroids stretched across a riverbank.
Here, for example, is a Du Fu poem. Among other things, there’s a certain simplicity here: one strong emotion resonates through, and unlike much of the English verse I grew up with, it’s firmly in the present tense:
A LONG CLIMB
In a sharp gale from the wide sky apes are whimpering,
Birds are flying homeward over the clear lake and white sand,
Leaves are dropping down like the spray of a waterfall,
While I watch the long river always rolling on.
I have come three thousand miles away. Sad now with autumn
And with my hundred years of woe, I climb this height alone.
Ill fortune has laid a bitter frost on my temples,
Heart-ache and weariness are a thick dust in my wine.
Which I suppose is why this appeals to me – there’s a rare clarity here, even if the translation might be inaccurate.
So the Tang poets seemed like the right place to start with for my experiment with machine-generated art (and besides, the excellent GWERN already did the usual English). Right now, I’ve snuck away for a few hours from a my statistical models to peek at the code I set to run this morning.
Among those of us who work with machine learning, the work I’ve put into this whole project is trivial: a tiny dataset, a cup of coffee, a few lines of Python code, and a single cigarette while I waited for OpenAI’s transformer-based generation model  to download.
The OpenAI model, like most neural networks, can be thought of as a rough analogue of a human brain – a collection of artificial neurons strung in layers, lighting up as fragments of thoughts (inputs) pass between one layer and the next. We teach it by pointing it at something and telling it to produce something similar. The boffins at OpenAI have decreed that their full model is too complex and human-like to release – God knows what people might do with it in this age of fake news – and so they’ve banged the drum and let out only a tiny, child version of their beast, the GPT2 117M.
No matter. We shall use the child. I’ve set it to train on a collection of Tang poetry, and given my personal biases, both Li Bai and Du Fu feature prominently in here. It has digested and spat out some poetry at me, and I, acting in a role much like a subeditor at a newspaper, am going over copy turned in by the new journalist.
Here are our first results: six poems handpicked out of ten efforts. My edits have been to delete no more than two lines per poem.
ENDLESS YEARNING I
I am endlessly yearning
To be in Baok Sha.
…Insects hum of autumn by the gold brim of the well;
A thin frost glistens like little mirrors on my cold mat;
The high lantern flickers; and. deeper grows my longing.
I lift the shade and, with many a sigh, gaze upon the moon,
Single as a flower, centred from the clouds.
Above, I see the blueness and deepness of sky.
Below, I see the greenness and the restlessness of water….
Heaven is high, earth wide; bitter between them flies my sorrow.
Can I dream through the gateway, over the mountain?
Breaks my heart.
ENDLESS YEARNING II
The sun has set, and a mist is in the flowers;
And the moon grows very white and people sad and sleepless.
Would that it might follow the spring wind to Yanran Mountain.
Since nobody can bear to you the burden of my song,
I think of you far away, beyond the blue sky,
And my eyes that once were sparkling
Are now a well of tears.
… Oh, if ever you should doubt this aching of my heart,
Here in my bright mirror, come back and look at me!
This isn’t the best poetry I’ve seen, but it isn’t the worst. Parents, I understand, try to get children to write poetry all the time – mine certainly did – but it generally takes many years before the little bundle of joy stops shitting their diapers and decides to take on Robert Frost. I’ve spent maybe three hours of my time on this so far, and most of that was spent sorting out code issues.
THE HARD ROAD
I would cross the Yellow River, but ice chokes the ferry;
I would climb the Taihang Mountains, but the sky is blind with snow….
I would sit and poise a fishing-pole, lazy by a brook —
But I suddenly dream of riding a boat, sailing for the sun….
Journeying is hard,
There are many turnings —
Which am I to follow?….
I will mount a long wind some day, and break the heavy waves
And set my cloudy sail straight and bridge the deep, deep sea.
DOWN ZHONGNAN MOUNTAIN
Down the blue mountain in Feng district
You have found your home.
The wind is beating at us, beating at our ears,
And we see only the dark clouds;
We hear only the low wind rustling grasses
Under the quiet river;
And the farmers all are returning what they have,
Washing their fields and burning them.
The GPT2-117M model seems, to mine untrained eyes, to have picked up the ‘form’ of Tang poetry more efficiently. Some rote phrases are inevitable given how small this dataset is, but I’m surprised at how little there are. With a few careful cuts – a line pruned here and there – I can bring out the impression of one overarching emotion. I’m particularly proud of this:
THE LAMENT OF THE ATTACKING EMPEROR
Soldiers are sent north to guard the City of Silk
And east to receive the rain from the Spears of Heaven.
The south runs its wall, the stars are rising,
And our footprints are three hundred miles away.
How can I bear to sweep them away?
A vanished river is forgotten by the people….
Who knows if it is still alive?
… Who knows if it ever was?
Those last two lines, I have to stress, are most definitely not mine.
There’s a man I keep coming back to whenever I see something like this, and that’s the chessmaster Gary Kasparov. Kasparov is possibly the greatest human chess player we have seen to date; from 1986 to 2005 he was the world’s best at the game.
In 1997, Kasparov was defeated by a machine – IBM’s Deep Blue. The move changed chess history , and I think – looking back – that’s really where the “human vs machine” fear really hit home. Ever since then, chessmasters – the human kind – have accepted getting thrashed by machines.
What did Kasparov do? Kasparov went away and made computer-aided chess work. He took human vs machine and made it human + machine. His thesis is in the title of his TED Talk – “Don’t fear intelligent machines; work with them” . Today, some of the world’s most powerful players are cyborgs – combinations of human players and machine intelligence – and they are damned difficult to beat .
I believe in this human + machine philosophy. Over the next year or so, I’m going to be launching more little experiments along this line. Let’s get see where it gets us.
A friend of mine, a mathematician, recently posited to me that the role of poetry was capture intricate emotion; I countered by saying the role of poetry was to convey information through the ordering of words as much as the words themselves. But in both our arguments there was the implicit understanding that there was a poet, some creator with a sense of purpose, be it information or emotion. I wonder if we can have the same argument about this, or whether we have to start that argument again with several biases removed.
 https://www.gwern.net/RNN-metadata#finetuning-the-gpt-2-small-transformer-for-english-poetry-generation  https://openai.com/blog/better-language-models/  https://www.chess.com/article/view/deep-blue-kasparov-chess  https://www.ted.com/talks/garry_kasparov_don_t_fear_intelligent_machines_work_with_them?language=en  http://www.bbc.com/future/story/20151201-the-cyborg-chess-players-that-cant-be-beaten
*edit note: this article formerly read “GPT 2 110M” instead of “117M”. Fixed.