ChatGPT plug-ins are an amazing way to learn more about prompt engineering. Specifically, each plugin exposes a field called description_for_model.
My sense is that the description_for_model corresponds to system messages that we include with API calls to the model. Both set up the context to guide the chat session.
For this analysis, I looked at the extant 127 ChatGPT plugins. To quickly get the manifests of all 127 plugins, I followed the steps listed in this reddit post.
For Firefox (Chrome should be mostly the same):
This gave me a 274kb JSON object containing an array of plugins with the following keys
[
'id',
'domain',
'namespace',
'status',
'manifest',
'oauth_client_id',
'user_settings',
'categories'
]
Of these keys, the most interesting key for analysing prompts sits inside manifest > description_for_model.
I first extracted the description_for_model of all 127 plugins yielding an array of 127 strings. Then, I got the length of each string to get an array of 127 numbers.
I then asked ChatGPT4 to suggest some statistical analysis I could do on this array. It came back with the following five high level options.
Luckily, since ChatGPT comes with the Wolfram Alpha plugin, I was able to throw this back to ChatGPT after enabling the Wolfram plugin.
In response, ChatGPT sent my array of 127 numbers to Wolfram and within seconds, came back with the following insights.
The frequency of each number in the list is mostly 1, except for the numbers 60, 192, 98, 155, 104 which appear twice. Not very useful information, imho.
25th percentile (Q1): 117
50th percentile (Q2/Median): 280
75th percentile (Q3): 984
See the massive jump from 280 to 984?
This means that about half the plugin developers are not taking their description_for_model string seriously. In other words, they are half-assing their only existing tool for AI Engine Optimization.
The outliers in the data are: 5325, 2352, 2652, 6924, 7881, 4707, 4211, 2477, 3797, 3603, 5381, 3015.
These numbers are significantly different from the rest of the data. Not very useful information, imho.
I'll skip visualizations for now because I don't know how to insert images into IH posts
There are quite a few conclusions to be drawn from this exercise.