GPT-3 and SEO: Overview

If you’re reading this article, you’re probably already pretty familiar with the basic idea of GPT-3 and likely have played around with it. The focus of this article is the impact we foresee of GPT-3 and SEO as converging trends.

I’m not going to spend too much time on the mechanics of how GPT-3 and ChatGPT work here, but if this is an area of interest for you, I’ll share a few links that can help you get more familiar with it: 

Most people, of course, probably have used the ChatGPT interface more than the API when it comes to playing around with its capabilities. What you’ll immediately bump into when using the ChatGPT interface is the fact that it’s not connected to the external internet beyond being able to serve answers to your questions and, frankly, that’s probably a very good thing at this juncture. The question on people’s minds in our industry is, how is all this going to change SEO and search as a whole?

Some powerful SEO examples

We can begin to explore this by looking at the way that it provides solutions for tasks commonly related to SEO. 

First, let’s ask ChatGPT what it thinks about its role in SEO

Fsylrfq2Cxmtyuoastbgh1E2Piv1Nhlqjenbzsr4Qnvs2Wsremmigogkvttgh53G81Rj3Nh7Yjwqvgppkpm5Eatotfxfs6Un7Uznphkaolhzxyctxkvwzmf8Wpdowcqczwgwh Plglvwaz5Uu19Or06Luovydxtqzu7Oukaipdhcfuqny 6Rz Y6Fhbbq

Jezfrh3Otgtcm3Wnbcfgfunejxf Nlvfadvocdapm9Zrqaxpgeg2D8Zpyg13Xhkehxszurrv8Uh5Yknhroxql4Yb4K7Nkcyerps0Kwq8Xsrofsklsltjjliq6Y6Svijfpqdnnyheiv9Amx0M5Nvd9 Tuwaaber9Xsu98Uheyjq7Qdiihwdftdyvajaho0W

Pretty hard to disagree with that, right? 

We’ll talk more about this but this is a fascinating response so far. 

Next, let’s show off some of the areas where GPT-3 and especially the Chat interface can help to quickly accomplish certain types of tasks for SEO.

We’re going to start by generating some keyword suggestions so we can feed them to GPT-3 and examine the results. 

We’ll go with the topic of “running shoes”:

Yqcsunnuj 4T6Vmf1Xey9Xc2Qtwbyd1Y3Xjoljkpf9Cy6Sx2Nbfrcixmabetrc61Qkhgwvpwwkjjkw4U4Lpxg9Kiehps 89Tsx6Frww8Fggyq5Svs29Voprjpbh2Mtq5T4Kai9Ze0D1D77Fiiwzo463Rbfzgb02Fi0 Dem8Lhsza75Sc83Irmvvq0Iq2Va

We get roughly 3,000 keyword suggestions from this general topic. Let’s download them and see if ChatGPT will take all of them. 

Stbu8Zjmuluxdk2Abzprx7Jkmm7Thfrylo5F3Fmo4Tyug4Fpvmxsjv7B2Satsjjhlq0J4 Ujuxnm3Tm0Lu6Lawynr6N8 Oyd1Vcblhhuwshv8K5Qr2Pjxhkjcvqqjdsam5Vzrbcobeyku76Gzy4F2Ut9Hrexm4A194Qhlwvfmfoiumo3Jygybj8Al2Poqa

Nope. Let’s try with 250 keywords.

Iv5M2Hxzsffsrehrzyqgb8 Jheikfraf2Pechvzuvbbinn9 I1E6Ioyy8I2O7A9A4Dss5Job6Yx0Tlxlikqxmf9Vvxi Junbhcq1Z5Uuop Pyp0Nmjr5N4Wbe0Wpytit6Kva5Slxncveyxgvd5Wzixrtie J9I54V 9Xupcgrt9Irk X4Rjkj41Azg8Za

Ok, let’s get to work. 

Keyword grouping is one of the hardest and yet most overlooked activities when planning SEO projects, simply because there are so many facets to the data of even a single keyword. 

It comes down to which facet matters to the users in the business and, if you have multiple users, then you’ll get an equal number of opinions. In larger organizations, there are entire teams dedicated to building knowledge taxonomies but SEO teams typically don’t have this luxury and are forced to do a lot of work with a small number of team members. 

Can** keyword grouping** be made easier?

For this reason, we have some automation tools in our platform to help with this and are always adding more. Based on this experience, I was curious to see how relevant the groupings that GPT-3 would generate based on some minimal instructions. 

Ymt3S3Tjvrt 3Pp3Ykmzn7F8Utjdawk0Eyi3Vjqeab73Tuchxbyteyjvellsvzbaiof4Zk9Qmhi8He5Z9Ptzxwsaamucrv1Rxautd8 Pnzthv5Q Yt9871Ttdpxlgndz3A1Of Zo8E6Pcdnicvxim8Lushfpk8Szbuemrnjb7Dpyde A Ao2Wbx1X Moa

And ChatGPT stopped there. This is quite typical where the service will stop far short of what you requested, obviously to save computing resources, which is quite understandable for something that is freely available at the moment. 

So I prompted a little further and got this: 

Xwozoefdvta O5Okaox97Sgmzsdouvq2Gvmqsmixej31O1Butg Pqy1Gpz8Hp5Nkot5Lmrbl1Twb6Xnat Oho8A00Ck5Acoxjw7 Z6V1Rykoiiiahqloxquqtugoasjd9Su5Tel2 F Dybkewdevekd6Zz3 F0P1Tppwce3Xy6D8Wxpjgl6Wddtffpchpa

And it bombed out again due to server capacity. That being said, these reasonable attempts at grouping the keywords I provided with only the bare minimum of instructions. 

Next, let’s see how it does with generating some Python code to do the same thing. 

Code generation for automated keyword grouping

First, let’s trim down the size of the list to 25 keywords because GPT-3 kept timing out. 

Then I told it do write a Python script that groups the list of keywords based on topical relevancy: 

V9Y0Pijzykm485Asodvw4R4Kao Hwhyiisbob Z7Sjguhjkej6Npsmnxlo Eqko1L5Qkoklto6Qcem 6Ipntql3Iycqbjqz1Eenam N6Zvvmrnj6Rzwfiecfi0Kkscyiei4Fkil5E6Hnybhk93Ucc4Rm0D05E0Mefa5At9Rsn1N94Wnmz7C0Gcbhsuvk8A

Rwtcraohzqqnx7Xyyfnndqyzrhoitaro0T7Q7Ey7Bzibhmd1J0 Frbu9H 0Zhy88Mx5J 50 Pbe4J Zmbwq3B5B Fyqzion8Onhjeruts33C92Omkhq94Jj6Cjbxocbiegzfrcdq Zqy5Aoccr8Yovzhsm38G 4Mxrew9T06Fqods1Pv1Eltthxo2Kktq

Ok, so it did write a Python script to group these keywords, but any developer will immediately notice that it is a very simplistic approach, based on creating a small dictionary or library of potential groups based on word mappings. 

This approach in and of itself is not bad at all because it is easy to extend and can be easily explained to anyone on the team of stakeholders. 

Lemjax 1Mtmqv3C5L9Yeh2 Yzauwkccma31Aw6Zvg3Djdcek6Aem Hy Ptt4 F74Xlrqaioevaoo1Luwz8Zxocgvkdleueka Ozjfam5Fwrpqgapjwgnyxeccu93Dexxr73Ekfkhqmtayru1Z6B5Erjlmcihjr7Yrjvmzkszljkekxoplwegouagytwcww

But, I wanted to see if it could do something a little more dynamic: 

8Qyjnjcfw4Ffrh95Mya Qkw9Ywt1Jux38N9Vc1Npbkwq8Orqnbvsor Qjxnctlvmrxxuz6Mhkouty9Yjneslldpmy3Jre9Asdqyzpzkbwwtdnfzj4Lxnq 2Uu69Ktjcbasujrsp4Vexusbwlsgjqsf1Oofqwyshit5Ozmxzcd96Jlijyxsjbu6G Caw2G

I had to cajole it a bit to get it to write the code: 

Ce4Idf Yxocbzrlzwtars86Bkg2Rdombzltv Bxhhcrioz2Lqe4Mmlbuiuxudne Shinkbjoqavi363C4Uiy6Fddqp7Ga3Bgztdvygi7L9Kieh7Uxvcv7W5 Ffoaxto X2Cdrgwzs199Khqaiecvxyeh7Cue Az9Yfvlosffqe4Kwnzd3Qvtbvoxqmclrw

And finally, here we are: 

R9Dl6Anqtvn8Rkteu W Jqgvobttgf Dq Hnp8W Uiq4Ilt0Ygqbo7Ysap0Hqegsy Fbyo3Cq1Tkngh2Bxfxfb5Cnnznccioay5Exqdfdcg35Uwwffdiq4D6X Z59Wmchp1Gbqayrlyqkkwlqfzl2 Nfifvvi5Xwjlwa7Dy6Leol9Jyg729Xogecbcxhqq

X3E7Zqasz9Rat6Hfvamzf3Ebge39Xt1Mq69Jvexvxup9U9Rt0Jrmu0 Kxvucfuzhxghtw0Eb4Xkmmd2Iq0Roxe3Cfyf7Vgursra1Awf4Xgb0Uz8Ykg4Mv 1Zax5Bqpgdyozzewbczoyzobj3Z2As8 Dedlrucbgesoxfwksohsyyzqxoac2Istkjuv5Buw

Ok, fair enough. There is a lot missing here but ChatGPT itself pointed out that there were many more dependencies and preparatory work needed to build a working version of a good pipeline for this process but it does give a rough overview of how this could work. 

This is well-trod ground and there are many great examples out there for people who want to do this themselves. 

How to make schema generation work for SEOs with GPT-3

Next, I wanted to see how well it would generate JSON-LD structured data, since we’re talking about items that you’d typically see in an e-commerce website. Being able to automate this process based on data in your catalog is one of the stronger use cases for something like GPT-3 in automating content and / or markup for SEO. 

It also doesn’t, in my view, really violate the “organic content” principle that I will discuss further below. 

E Vtc94Dlbvqdpwkayzdpi4Tnqp3Mpqhn2Xlyqoypmabtu1Djsfjbhecy1Hhm4K Nxj0Biu4Fkfap1Pxwcohxukwb0Xbvaocf7Yywgxd5Lm Qjqfvrm04V4Nhrnrbcvrfzyhdw2Oghvhcydck2Rueoe2M1Ciol6Aabrgcs1746Vbbjnhaa6Xye4C05Qaha

Wetqco0Wa9Xeqn5Xttgn6Kepl6Jxoeobhlexqcuc2Zemyti1Tcth9Egdhrzuekk5Zv2M Iuc Gbangifqnfaa12 Tt5Dhqpe Y Sueqravl37Ddw1Cx Veuke Jm9 D7Fvkdgw6Dsi Odhkec Nylr6Dru9Enifrgsuaffg6Njy3Yvpjxqdno9Asgd2Qiq

Great. Obviously it can be expanded but it’s a nice way to prototype some quick structures. 

Let’s expand it out further to encompass a graph or ontology of categories and products for category listing pages in an e-commerce context. 

Cqq2Qva9 4Pizu8Svdptlybe7Tc4Icklgd6 Kuarjhephln5I5Xewwibd N1 1Parlqdrou0M3Msw2T N5 Jcdodirbdgtdb Jbykzdxvrgxctrghuxmnqomjqvpcazqcv4Xx6Ubm1Te2Ouyzcrney6S Kounxdt13R0Cpb6 Ovmvtxvmokjlgbwktlvg

We can run this through the JSON-LD playground and see that we get valid results. 

Ntbsrw4Ipx3Xrdaw2Fcpvladx1Dfjfnavuowyojlxm7Feulo5Qefbkjnwyyr Bu0Msdlyqldvxfga2E1Znmq11Vkmnj Unaqkchus47Ibiyo5Jeyq Oeb 8Vslkve03Uvumtx7Dp5Bqfo1Qjyql7Epru6Zh3Iogiuclqifcihwsdrwwzkopsuwe6R3Kphg

Of course, you may want to tune this according to the structure of your site but this is a very impressive output based on such high-level requirements. 

Search** intent mapping**

Here is another one that is a very helpful filter when looking at keywords, which is the concept of search intent. 

I was curious to see how much ChatGPT knew about this topic: 

Lum71Nbz6Kowrmiwdkpwzwaqpqf4Lnuu 4Oondykucxwlzmpcl2Oeysuu4Dnzwwdecf Aud9Uggttq6T 1Zn78Kbrc2Tqnm5Rahl8O7Kouaztm2Py9Gtqsu7Un274Crieqehdpwk64Jgy3Prwzrcxpvxtnufm28M8Mdcspflvjhvd Taapffbgtzehsiha

Solid. Let’s see what it comes up with for my list of sample keywords: 

Oqxhjabw7Vd29Qdwkt8Ql4Z Xqtqyw7Hq6Wbgcchnlnk5Imm0Duiwd04Z8U4Jjtkb2Ef Lz2Aotfv5Aondjkyo19Fq3Uhbnegbarocia F Imqkz4Oii Tiqwbggcurtjetepr0Epdbssjbrdeag9 3Bzwy5Luku3Ph2Kwfnerejxrr6Pixef6Tkzp4Kyw

Pretty good, especially since these are all product names that do indeed tend toward commercial / transactional behavior. 

Build keyword lists

Next, I wanted to see how it did building lists of keywords. This is another area of SEO, which anyone in the industry will immediately tell you, is one of the hardest and most time-consuming aspects of the business. It’s not that it is hard to get lists of keywords, that part is easy. The hard part is getting ideas for keywords that are relevant, drive traffic, and also are not impossible to rank well for. 

I didn’t expect GPT to be able to meet all of those requirements, particularly because it is made very clear to you that it’s not connected to any external data sources. As a result, there is no way to to measure relevance (aside from the linguistic angle) or know anything about search volumes or ranking difficulties.  

Ft Hawahq1Nmge8Yhiktfdzmqwkbdx6Csmm7Qmap9Doqeo9P1No1K3Ibzknmpb Nb0Of Ramxqxgp Otg4Roqvde2Kqc5Gwxs1Pdom5Tiyq23Fhmbfyuxq5Lkv8Ovgmnv Sps8Tix5Wdbrgqyo75Rkcdc8Dfikevgcbtritulawjjwxstnkaksw2Ylmx W

Ok. Well, they are keywords. 

One nice thing is the ease with which things can be translated to other languages, in the same context. 

Translate them to Japanese:

9Dwtdp02T 0Knxrvnhiedmtjqtaj4Fq66A96Kxidg63A4822Xeum1Mxtbs2 2Nua2Wimexhqnjcjfm7Lhai119Lzfgdw79Bpjpsjj5R3Uai0Xnpa2 Jv Swnwty 1Rtck1Tupcyqrppx6Agevfyyvt4Hfcwbymkqpeafkfwah Rsufzbd5Xdn2Rqpveoba

There you go. The same keywords, quickly translated to Japanese. 

Do note, however, that there is a HUGE difference between translation and localization, especially when it comes to SEO. 

I don’t see good solutions yet coming from any of these AI tools to solve the localization problem with a high degree of cultural relevance but the translation part does at least get you on the path. 

Extract keywords with GPT-3

Another notoriously hard problem both in SEO automation and NLP in general is keyword or term extraction. 

Again, it’s easy to get words from text. The question is always, how relevant are the words that are extracted. 

1Z1 Rntdiqet5K0Tmzlfxo37Pxnly7Mevtvj5Zsoizuxpej9140Xsyf7730D41Vdce3Vy3Seactlszr87Chsquoa5Mn9G1Vlk5 Iegf Fke Bgtz0Xw2Yeilofqaqt Bdjqfyaocwndrhp7N0L8Wqpcfmxkktepj9Qtxsz1 Ppvpe035G8Uwai5Z4Ixw

Dc8H5Pnx9C3Rixf8Ukhe40X Ppqxdkgta8Bswhtsarcyxfu7Olnkf 4Pkbncaegrma7Zo2Nn88Ib Ui24Esz6Vdsunrjwjlsyh8Qbzcww7Qducv3Qb8Zi 21Jyuo95Ei239Z6U3T5Uh0Dkcqgavgsqgc7 Itkaiz Gmbyooigxjuhj0Faebwi8X9Bjvopw

C4Mvyeiquoo8J0Asd30 0Pm029 3Frhgiynciufudrpafgb19Ii5Vs Ydgc62D6Lg7Vksvl8To Rbaludn5Fubrknxj1Xamu Avytwfazy1Xl204Kocdl4N1Zamu5S9Kfpwmi4Fndv4Ldkmxfapydrbxsahoazfvjzcosoegqcyt71Kgyozafidaqyrjsg

These are quite solid results.

Writing a content outline with ChatGPT

Now, we’re starting to move into more controversial topics: content generation. 

First, let’s start with what I believe is a still relatively benign activity, with some serious caveats, and that is generating outlines for new content. 

The main caveat here is that, if you simply use a tool like this to generate an outline and hand it off to some outsourced content writer, you’re probably going to get pretty vanilla content, at best. You might even get AI-generated content back! (more on that later)

Let’s see how it does: 

Kbfvyairdbdu4Traf9Bmbogxyhwl22Vtppjdqx9Isguoqkwqykvf4Fomhu1E Do Fikrfgj Wkbo2G1Gkma3Bokbhlnyxoobtbc8Hrsm9Mb2Qxnhlc7Q Ofrqs08P1Jnmrdsppdvztznst4P9Wpw7Vatfsrfc6Cltwr9S99Ykrpvi Nkghf9Gahdbpmkg

Pretty boring but it is an outline and might provide some basic writing prompts. You can always make more specific requests. 

Writing a blog post based on an outline

Next, let’s use the generated outline as a prompt for writing an actual blog post. This is where people are going to get in trouble in the coming months and years. 

Epf1D Gjngunlrbq2I8Asellrq5V4Q4Zarf2O5Okeq7Bifiekvaftfft 4Aa0Ubf415Wtnecqyi9Vspwuczguxhor3Ec9E5Sxjbham2Mmdnoowwgdkezrsddela Kwlke3Ubypiqovhy64Jrtuiikpwvcbk11Id8Hzmp7Cjsh2C Jko6Gecfxqo Ai Aeq

It kept stopping after a few sections, clearly not wanting to spend the CPU time on creating the whole thing. 

But you get the basic idea.

The big one: writing a blog post from scratch

Here’s what many would consider to the be the Holy Grail when it comes to content generation (I’m not a fan, I’ll be honest), but let’s see what we get:  

M934Y3Zbbzuytb8Zqfijdbmzfgmzbkr G0Syq7Uaivzxvm47Jm03Jup3Vcuehewbw2Qdulkfzbykiugjh22Cq Bgmielmu56Iwxjfvqp7Rt1Nodohjsiqeu7Ujedeyog5 Kxqnwehsxyglrc1Whclnwy Ew35F7Ajts3Hzmajixqdvuxo2S01Jb1Ic22Iw

677 words, not too bad. 

It’s an ok start for an article but you can see how the web can very quickly and easily be polluted by what is ultimately low-tier filler content designed to find its way into Google’s indexes. 

For those who care about energy consumption on a global scale, one could make the argument that AI-generated content will contribute to an incredible increase in energy consumption simply to enable the search engines to identify generated vs. created content. 

Let’s look at another task: 

Generating titles and meta descriptions based on keywords, emphasizing the keyword in both fields

Coming up with good titles and meta descriptions for your content is another task that can be time consuming. I figured ChatGPT would do a decent job of this due to the shorter length of the text required:

Uo0Snz Ja1Dfgczf9Zgzmceejltynozc3Tmqnktflsfpo0Zarz3Nyddkuzahkiy J7Lmargqxuklw1B Ydcdpwbyccqdqx0Gobi Mn5Vvs8Qzq6L8Eocwfuobk1Hdijsong72Elj5 Fnjsy2Bp2Npmdurmylxmbodif6Osydvvuvnivcl81Bkbyficvxlg

Could be worse. 

And for meta description: 

Xore3Mulklxyfhdozripceck0Odpqwvpjrznnfxvsxr0O9Xdxlvci3Qmfuhcrvvg62Oq2Bjo Utyh5Ao Vmduzg8Ynewxka2Aoelhlrdcxronbstn4Mxgcamkcwsm Rpcnjqtjhom6Rlz3Rbb68Fn0Ee By6Zvuwlue6Oforloky4F Hk9Lfbu03Lg3Nqq

Again, not bad for a robot. 

At a minimum, it is a way to eliminate duplicate and empty title and meta description tags. 

Of course, Google does sometimes rewrite both of these fields in the SERP already and this is likely to increase based on usage and other metrics. 

That said, this could still be a pretty solid use case for at least fleshing out a bunch of titles and meta descriptions (we’ll discuss the moral / ethical / creative questions below) pretty quickly. 

This would get even more interesting in e-commerce use cases, especially using the API, when merged with data from the catalog:

Ivli0461E8A63Qmujwc37Idn6724Zsmn4T28Gdganbhesyvcry7Jt2Gqjtw3Aslnexfy Pquyrxrm3Fe3Gzrra3Jswtx562Qvaiupdmovauwtw Kxlddqnieagvtfgvqcz2Kjgafum Em1Qujaoaqsm0Wkwe7Zrsx2Qgqjqyqixkgf9V5Ntyzyb4Wz Sww

Much more interesting. 

It still requires some tuning, but once you get into automatically generating text, tuning is what always requires the most time and energy. 

It can require so much time and energy that you have to be aware of your point of diminishing returns relative to the perceived time and cost savings that you were initially banking on. 

Closed-loop thinking vs. Open-loop thinking 

Now is a good time to wax philosophical and start coming up with some concepts to evaluate the ethical, moral, and metaphysical questions that inevitably arise in these discussions. 

Let’s start off with a real high falutin’ question and see what ChatGPT says: 

Nm73M395Ifcomte6Tkht0Bmhhuqlv0Yj39Vkryub6Owcbsnbhswutnpygixqko6Fdcaisjpxsk2S0Ee0Iks9Jke Wkjvp2Fwvs7Tiuh4I672S73Gsjp1H2Zylrstzlm3Hm464Ssjahj4Qrjivnk1Bh Vkzam Pb0Zf8Uayc2Go254Udwijatpm78Tjmcwa

We’re off to a good start on the topic. 

Let’s ask it about one of the central debates in AI:

9Riqol4Oias89Up3Jwhc8Ht0Zjxoi P7Wwim4Ttn0Lgbbh6Spkrnkw25Aqgsmad8E1V2Coznvghwenacoooulvydjmhg0Xm5Klr9Qwdrcfhko9N7Qpqlewxs9Ax1Zkgq66Udnwcyrvpftidyjbbgdvfixdc6Ihordzyrsyec6Ce9Ny9Rz5Laz Hgwzcyjw

Pretty good answer. 

The next two questions were a little out there but the mood was upon me so I went with it. 

Ss2Cpvn0Qi8Mj5Pygtqm8Vjresvcvnavjm98Vh6Wlixtk58Dwhlj5M6 Nabdib1Tdwwvftrm3E3Rrl0Kkq3Kozvdbqleedz6Jqxzkugrcbhc91Urvgrinkou424Ydhxoiwbbioenz9Vozzhnjcfylzx5Eygyfzoi6Algb5Lttkjww0Pgd Plfkjkq2Ifdw

And, finally:

2Sk4Jp4Qgwtyqurvh4Pmdjoms 62L7T Rzccxaumjjabonw2S62Kseryumsdrktvwddu Cqlcz0Ii169Ls0Tp9Bibiajamusghkv4Uqlae9Sm6Qefu83Dzlhet8Hojlxlxyp7Wpvnkagborbctce Abp2Ecpss7Dhrstikuny Ctnxyok9Wwdq5N9Z Gxa

Some further clarifications and questions: 

Y1Drqjx57Zsqm2Lkmzna Vn2Uwcjaw6Biszrdepsqyrmo3Tjh9Hhtgu5 Epbxj5B1 Guwfcp5Uspftmh4Lithc8Lrj 8Ockihhizizrz30N8Pwnfblc Scoxhk5Ilera1Ffr Bpyir7Bz K9Kwtprmcerbzu2Hwvhuuyy6Byaphaociyvuxgy1Wwc9Wwxq

So, that turned out to be a pretty interesting discussion. The main point and takeaway is the idea that GPT and similar models are ultimately gleaning content that was created by humans. 

This creates a closed-loop of available thoughts and words that it can choose from. 

The societal risk from having a web polluted with massive amounts of AI-generated content is that a great filtering occurs from the natural thoughts that flow out of the experience of humanity. 

We’re already dealing with the effects of massively reduced literary skills and vocabularies of modern writers (on the web at least), due in no small part of the “mercantilization” of writing that has already occurred (ironically or otherwise) because of the preference by search engines for easily parsable articles and, in particular, “listicles” and other such tragedies.

A novelty-seeking mechanism in search algorithms?

A long-standing theory that I’ve had is that search engines do function in some similar ways, particularly with the advent of deep learning, to the learning mechanism of the human brain in that they seek “novelty” and that this novelty-seeking mechanism is what ultimately (ideally, not always in reality) gets expressed in the results of the ranking algorithms that produce their results. 

If this theory is true, then this could be one of the major failing points of AI-generated content. You simply don’t get novel results. 

Remixes, yes. Novelty, no. 

The dangers of misinformation and potential compliance issues

I’m not going to post them here today, but I’ve seen some cases where ChatGPT very confidently provides answers that are outright, simply, and factually incorrect. 

The danger of misinformation leaking into content via GPT is quite high because it sounds so confident. 

This can create all sorts of hazards for online retailers, from product liability to cost and inventory issues. 

In other industries, such as anything related to finance, insurance, or healthcare, using auto-generated content is probably a very bad idea no matter how you look at it. 

The ethics of generated vs. created content

Even before there were any working systems in the public for automatically generated content, the word “content generation” became quite prevalent in the digital marketing world. This term always felt dirty because, at the time anyway, we were talking about humans writing and creating content, not “generating” it. 

I put it in the same general category as referring to employees as “resources.” 

They are people, not resources.  

As a result, I’ve always corrected this term to content creation and now I find myself using it to further differentiate human created content vs. machine generated content. 

If we’re talking about AI, ok, you can say “generated.” If a human wrote it, say “created.” 

Where things begin to merge, is when you move from talking about a single article or other piece of content, where we have this “created” vs. “generated” dichotomy, into the production process. If you’re going to scale to the level required to compete in hot markets, you’re going to need to build a creation production operation. But this is still most commonly done (as it should be, IMO) with created content, but making smart use of tools along the way to optimize the workflow. That is a topic for another time.

100% organic free-range content

This matters also from a technical and a business point of view. 

At this stage, it is hard to predict exactly what Google and the other search engines will do and how they will evaluate the question of content that is generated vs. created. 

My personal prediction, however, is that within a few years, aggressive moves by Google to identify and penalize generated content in its rankings. 

This assumes, of course, that search interfaces will even look the same 3-5 years from now. 

There is certainly no guarantee of that as anyone who has used ChatGPT immediately noticed that it provides entirely new ways of interacting with information.  

At the same time, we can also expect Google to come out with new AI products of its own and it could very well be capable of annihilating both ChatGPT and GPT-3 in future iterations. 

We won’t know what that will look like until we see it. 

There was a very good article published in the last week or so, on this topic of truly organic content, called “Certified, 100% AI-Free Organic Content.”

It’s definitely worth the read. 

The other envisions a world in which both humans and search engines want to filter for content created by humans. He proposes the idea of a meta tag or some other sort of HTML tag to identify AI vs. created content. 

I like the flexibility of this and I think there are even more opportunities to do something similar at the site level. 

A thought experiment for self-certification of 100% organic content - organic.txt and organic.xml / .json for SEOs and site owners

During a discussion of the above-linked article on Hacker News, I mentioned that the idea of a site-wide file could be an interesting way for site-owners to self-certify that some or all of their content was truly organic. 

We already have standards such as robots.txt and XML sitemaps to provide indicators to the search engines how the site is structured and what can be expected during crawls. There is also security.txt and others. 

It’s easy to imagine a similar file called organic.txt that is structured similarly to a robots.txt file with directives that indicate which, or if all, of the site is completely organic. An extension to sitemap.xml could also add an attribute doing the same thing and, of course, this could also be achieved through meta directives at the file level. 

How likely this is to happen will ultimately depend on how much the search engines, particularly Google, profess to care. It would save them a ton of compute time and money, however, if a reliable market for these identifiers were established. Sites that violated the trust of these declarations could be harshly penalized in the rankings and the web would, frankly, probably be  a better place as a result. 

You can use GPT-3 as an advisor instead of a writer to get good coverage

All of that said, I still see a lot of value for automating SEO tasks in the sorts of language models that GPT-3 represent, as well as the genius of the ChatGPT interface. It’s an easy demonstration of their power.

Based on my own usage and some of the experiments above, I think that ChatGPT can easily fill the role of prototyping tool, data explorer, and advisor on some basic concepts. It’s almost more like a dynamic Wikipedia than it is a search engine, at this stage anyway. 

Going forward

To round out what has become a very long blog post, I came away with a number of key takeaways. 

In general, I don’t think ChatGPT as an interface is ready or even designed to handle large datasets. This can certainly change, of course, as the platform evolves and as paid plans are introduced. 

The Chat interface is genius on many levels, as demonstrated above. 

This will ultimately lead to the further commodification of AI and I already roll my eyes most of the time when I see other vendors and SaaS companies in general talk about their AI and ML capabilities. The technology has become so pervasive that it’s almost redundant to mention it. This trend will only continue. 

Some of the task automation above demonstrates that ChatGPT can be very useful for task acceleration and ideation but certainly not a great fit for the finished product in a range of professional settings. 

I think there is a chance that it actually ends up having a larger impact on the Business Intelligence (BI) and Knowledge Management (KM) industries than it does search, but we will see. The possibilities for new forms of advertising are, to my chagrin, limitless. 

How about GPT-3 itself for use within SEO? There are a lot of opportunities for automation and use in classification, extraction, and ideation. For content generation? I certainly wouldn’t recommend it.

In the SEO world, SEOs and executives need to be very disciplined when evaluating this technology. It looks great but it can potentially open your company up to liabilities that weren’t a big risk before. There will be a need to monitor and evaluate new content as it is being introduced to determine whether it was created or generated.