{"id":14914,"date":"2023-03-21T17:54:18","date_gmt":"2023-03-21T15:54:18","guid":{"rendered":"../../../../@p=14914"},"modified":"2023-03-21T17:54:19","modified_gmt":"2023-03-21T15:54:19","slug":"how-chatgpt-works","status":"publish","type":"post","link":"../../../../how-chatgpt-works/default.htm","title":{"rendered":"Looking Under the Hood\u00a0of ChatGPT &#8211; Its History and How it Works"},"content":{"rendered":"<p><span lang=\"en-US\">Released to the public at the end of 2022, OpenAI\u2019s ChatGPT has quickly become the fastest-growing internet application ever and a disruptive mainstream phenomenon. The introduction of ChatGPT and its successors is a beginning of a new era in our relationship with technology.\u00a0<\/span><\/p>\n<p style=\"text-align: center;\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-14917\" src=\"../../../../wp-content/uploads/2023/03/Depositphotos_144712305_L.jpg\" alt=\"artificial intelligence\" width=\"800\" height=\"591\" srcset=\"https:\/\/www.tech21century.com\/wp-content\/uploads\/2023\/03\/Depositphotos_144712305_L.jpg 800w, https:\/\/www.tech21century.com\/wp-content\/uploads\/2023\/03\/Depositphotos_144712305_L-350x259.jpg 350w, ../../../../wp-content/uploads/2023/03/Depositphotos_144712305_L-768x567.jpg 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/p>\n<p><span lang=\"en-US\">But as we are becoming increasingly reliant on <a href=\"../../../../pros-and-cons-of-artificial-intelligence/default.htm\" target=\"_blank\" rel=\"noopener\">artificial intelligence<\/a> and machine learning in almost every facet of our lives, we need to be more aware of the inner workings of these technologies. <\/span><\/p>\n<p><span lang=\"en-US\">Looking just a little below the surface can help us be more intentional with the use of AI, assess its outputs more critically, and fully tap into <\/span><a href=\"../../../https://www.itransition.com/machine-learning\" target=\"_blank\" rel=\"nofollow noopener\"><span lang=\"en-US\">the available machine learning expertise<\/span><\/a><span lang=\"en-US\">. <\/span><\/p>\n<p><span lang=\"en-US\">In this article, we will delve into the history of ChatGPT and figure out how the tool functions.<\/span><\/p>\n<h2><span lang=\"en-US\">It all started 30 years ago<\/span><\/h2>\n<p><span lang=\"en-US\">Machine learning researchers and the scientific community in general have been captivated by the idea of artificial text generation for decades. <\/span><\/p>\n<p><span lang=\"en-US\">The core technology behind ChatGPT has its roots in <a href=\"../../../https://www.ibm.com/topics/recurrent-neural-networks\" target=\"_blank\" rel=\"nofollow noopener\">recurrent neural networks<\/a> (RNNs), which can be traced back to the 1980s. A neural network is a type of AI that mimics the human brain. <\/span><\/p>\n<p><span lang=\"en-US\">The work \u2018recurrent\u2019 in an RNN comes from its ability to store memory and refer to past data, which allows for a deeper understanding of the context and opens up predicting capabilities. <\/span><\/p>\n<p><span lang=\"en-US\">RNN is widely used in products that have to do with natural language processing, including Siri and Google\u2019s voice search.\u00a0<\/span><\/p>\n<p><span lang=\"en-US\">In 1997, the invention of a new type of RNN called <a href=\"../../../https://machinelearningmastery.com/gentle-introduction-long-short-term-memory-networks-experts/default.htm\" target=\"_blank\" rel=\"nofollow noopener\">Long Short-Term Memory<\/a> (LTSM) network was a major step toward advancing text generation. <\/span><\/p>\n<p><span lang=\"en-US\">The ability to store more data for longer further deepened the algorithm\u2019s understanding of words and how they relate to each other.\u00a0<\/span><\/p>\n<p><span lang=\"en-US\">So, researchers figured out the way for machines to talk nearly 30 years ago, but only recently has it become possible for chatbots to become as good as they are.\u00a0<\/span><\/p>\n<h2><span lang=\"en-US\">Transformers as the main enabler<\/span><\/h2>\n<p><span lang=\"en-US\">The real breakthrough for natural language processing happened in 2017 when Google Brain team developed a new type of deep learning model dubbed simply a transformer. <\/span><\/p>\n<p><span lang=\"en-US\">It\u2019s hard to adequately explain the major advantages of transformers without diving into the intricacies of machine learning, but, generally speaking, transformers can process input data all at once instead of sequentially. <\/span><\/p>\n<p><span lang=\"en-US\">In other words, RNNs make sense of text word by word, while transformers analyze all text as a single set of data. Transformers also can take into account the location of a word in a sentence, enabling them to decipher the meaning of words more precisely. <\/span><\/p>\n<p><span lang=\"en-US\">Importantly, compared to RNNs, transformer-based models can be trained significantly faster, allowing for much larger training datasets.\u00a0<\/span><\/p>\n<h2><span lang=\"en-US\">The GPT Era\u00a0<\/span><\/h2>\n<p><span lang=\"en-US\">The first version of Generative Pre-trained Transfomer (GPT-1) was released in 2018 by OpenAI. By augmenting transformers with unsupervised learning, OpenAI managed to create what we now call a large language model. <\/span><\/p>\n<p><span lang=\"en-US\">This time, instead of training the model with annotated data, OpenAI allowed the model to detect patterns on its own. Given that data annotation is a very laborious and time-consuming process, the company was able to drastically enlarge the training dataset.\u00a0<\/span><\/p>\n<p><span lang=\"en-US\">GPT-2 was released just a few months after the first version, and the third iteration followed in 2020. From this point on, it was all about enlarging datasets and increasing the number of model parameters. <\/span><\/p>\n<p><span lang=\"en-US\">Take note that these improvements were rather rapid than gradual. Just to give an idea, GPT-2 has 1.5 billion parameters that engineers could adjust during training, while GPT-3 has 175 billion.\u00a0<\/span><\/p>\n<p><span lang=\"en-US\">However, due to the fact that GPT-3 was trained on the largest dataset ever using unsupervised learning, the model mirrored all the good and bad that the internet has to offer. <\/span><\/p>\n<p><span lang=\"en-US\">GPT-3 was exceptionally biased towards many sensitive subjects, including race, sex, and religion. To solve this problem, OpenAI came up with InstructGPT, a way less offensive and opinionated sibling of GPT-3. <\/span><\/p>\n<p><span lang=\"en-US\">To achieve this, OpenAI had to bring human judgment back into the equation. The company hired 40 contractors, who were responsible for rating the model\u2019s answers to their liking with the ultimate goal of decreasing the use of toxic language and misinformation.\u00a0<\/span><\/p>\n<p><span lang=\"en-US\">The current paid version of ChatGPT uses GPT-4, the most reliable, stable, and creative version of the model. Apart from a larger training dataset, GPT-4 also takes advantage of its ability to process both text and images instead of text only. For example, now you can provide ChatGPT a photo of ingredients you have and it will come up with a recipe.\u00a0<\/span><\/p>\n<h2><span lang=\"en-US\">Closing Thoughts<\/span><\/h2>\n<p><span lang=\"en-US\">The development of GPT, as well as natural language processing in general, will continue to open up new avenues for natural language processing. <\/span><\/p>\n<p><span lang=\"en-US\">From creating engaging conversational experiences with chatbots to providing deeper insights into the way we communicate, these advancements are undoubtedly revolutionizing our world in profound ways. It\u2019s safe to say that the chatbot revolution is well and truly underway.\u00a0<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Released to the public at the end of 2022, OpenAI\u2019s ChatGPT has quickly become the fastest-growing internet application ever and a disruptive mainstream phenomenon. The introduction of ChatGPT and its successors is a beginning of a new era in our relationship with technology.\u00a0 But as we are becoming increasingly reliant on artificial intelligence and machine [&hellip;]<\/p>\n","protected":false},"author":6,"featured_media":14917,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_seopress_robots_primary_cat":"none","_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":""},"categories":[1],"tags":[],"_links":{"self":[{"href":"14914"}],"collection":[{"href":"../posts"}],"about":[{"href":"../types/post"}],"author":[{"embeddable":true,"href":"../users/6"}],"replies":[{"embeddable":true,"href":"../comments@post=14914"}],"version-history":[{"count":3,"href":"14914/revisions"}],"predecessor-version":[{"id":14920,"href":"14914/revisions/14920"}],"wp:featuredmedia":[{"embeddable":true,"href":"../media/14917"}],"wp:attachment":[{"href":"../media@parent=14914"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"../categories@post=14914"},{"taxonomy":"post_tag","embeddable":true,"href":"../tags@post=14914"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}