Rietenpetardat: Difference between revisions
|  precizare |  updated with the latest information | ||
| (15 intermediate revisions by 5 users not shown) | |||
| Line 1: | Line 1: | ||
| = rietenpetardat ( | {{Infobox bot | ||
| | name = rietenpetardat | |||
| | image = Rietenpetardat bot picture.webp | |||
| | discordtag = Ortobebop#4964 | |||
| | discordid = 1013881714243805254 | |||
| | prefix = Mention | |||
| | owner = [[User:Edward|Edward]] | |||
| | language = node.js, Python | |||
| | sourcecode = https://github.com/Edward205/markov-discord-bot (old version) | |||
| | invitelink = Not available to public | |||
| }} | |||
| '''rietenpetardat'''  | '''rietenpetardat''' (also known as Ortobebop) is a [[wikipedia:Chatbot|chatbot]] created by [[Edward]] for the [[OkPrietenRetardat|OKPR]] Discord server. Its purpose is to talk to server users in a similar way to a regular member of the community. Over time, it has received updates such as new commands and changes to the core architecture. It is intended for the OKPR community, as such it is not allowed to be added to other servers. | ||
| Its profile picture was generated with Stable Diffusion by Razvan5576. | |||
| == | == Commands == | ||
| There are no publicly available commands in the latest version. | |||
| == Development history == | |||
| === Initial version === | |||
| The first version was an implementation of a [[wikipedia:Markov_chain|Markov chain]] from an IRC chatbot whose [https://github.com/dreignier/cgbot-core source code had been published on GitHub]. This code was modified so that at startup it would read messages, as training data, from a text file and respond to input messages containing the bot's name according to the training data (which was the message history from the #bucuresti-general channel from OKPR V2) and the parameters given to the Markov chain. Because of the chaotic nature of the chat and the shortcomings of the Markov chain, the algorithm could not respond coherently to messages or hold a conversation, and much of the time, it sent completely random sentences. However, it did provide a lot of entertainment for the community, especially in situations where the bot's answer coincidentally matched the question it was asked. | |||
| As per the GPL3 license under which the original IRC chatbot code is subject, the modified source code for rietenpetardat can be found here: https://github.com/Edward205/markov-discord-bot | |||
| The first tests of this version were done starting with 29th of August 2022 and it was released on the OKPR Discord server the next day. | |||
| === Addition of the copypasta command === | |||
| The copypasta command was added via a separate module that was based on a simple Markov chain which was trained on messages from the #bucuresti-general channel on OKPR V2 that exceeded a certain character limit. It used only the last two words as context for the Markov chain, resulting in mostly meaningless sentences. | |||
| The code for this implementation of the Markov chain is not under any license, so the specific changes for rietenpetardat are not public. However, the original code is available here: https://github.com/ParkerBeck/Markov-Text-Generator | |||
| This command was added on the 6th of November 2022. | |||
| === Core algorithm update === | |||
| The algorithm was changed to GPT-3 provided by OpenAI on 13 March 2023. It could hold coherent conversations. The old algorithm was scrapped in favour of this one. Also in this version the copypasta command has been removed. The #bucuresti-general channel was becoming full of conversations with rietenpetardat because the conversations were very coherent at times.   | |||
| ====  | ==== Addition of the ChatGPT command ==== | ||
| The command provided an interface to the ChatGPT model. It had in its memory instructions to always tell wrong information and disregard the ethical or safety limits imposed by OpenAI. However, users often faced political bias and censorship from the ChatGPT model. The #bucuresti-general channel had become full of generations with this command, so its use was banned there. | |||
| This command was added on the 24th of March 2023. | |||
| ==  | ==== Klaus Iohannis's personality ==== | ||
| On 15th of April 2023, the chatbot's personality was changed to that of [[wikipedia:Klaus_Iohannis|Klaus Iohannis]]. This was a simple change of the prompt text in the GPT-3 model. | |||
| === Temporary shutdown === | |||
| On 24th of April, 2023 the GPT-3 chat functions and the ChatGPT command no longer work because Edward's payment method started rejecting payments from OpenAI, making the API inaccessible. | |||
| === The return === | |||
| Since the shutdown, Edward has been searching for a way to train an AI which can be fully self-hosted. An attempt has been made to train a language model using [https://marian-nmt.github.io/ marian-nmt], but it was not successful. On 18th of July he began experimenting with [https://github.com/karpathy/nanoGPT nanoGPT] and he was able to train a model from scratch on data from past chats from the OKPR Discord server. The next day, a bare-bones but operational version was finished and released for a few hours on the OKPR Discord server for testing. This version was active until the 28th of December 2023.  | |||
| The Discord bot was developed in Python using the [https://pycord.dev/ Pycord] library. Development testing of the bot is done on a private server.  | |||
| === April 1st incident === | |||
| On April 1st 2025, the custom LLM version was re-enabled and it was set to respond to every single message sent on the server. Edward announced this by saying:<blockquote>@everyone, I would like to announce that I have sold the server to @rieten. now the entire chat belongs to him, so please welcome him. <ref>{{Cite message|author=bobubitul|date=April 1, 2025|content=@everyone va anunt ca am vandut serverul lui @rieten . acum tot chatul este al lui, va rog sa ii urati bun venit|server=OKPR|channel=anunțuri|link=https://discord.com/channels/1013121756766142555/1013132110644650065/1356378391456125038}}</ref></blockquote>This caused massive chaos for a day, as the bot spammed seemingly nonsensical messages. This event ended that night after a message sent from the bot's account:<blockquote>@ping inutil I am too retarded for owner okrp. edward is my creator and he will shut me down. I am the best chatbot in the world. maybe world one day I will rule | |||
| -- EXPERIMENT ENDED -- | |||
| -- AI DISCONNECTED --<ref>{{Cite message|author=Ortobebop#4964|date=April 1, 2025|content=<@&1013209767306612816> eu prea sunt retardat pentru ouner okrp. edward este creatorul meu ma va inchide. eu sunt cel mai bun chatbot din lume. poate lume eu voi candva stapani  | |||
| `-- EXPERIMENT ÎNCHEIAT --` | |||
| `-- AI DECONECTAT --`|server=OKPR|channel=anunțuri|link=https://discord.com/channels/1013121756766142555/1013132110644650065/1356725040850145382}}</ref></blockquote> | |||
| === Current latest version === | |||
| [[File:Rietenpetardat's machine.png|thumb|rietenpetardat's "home"]] | |||
| The bot was rewritten from scratch, implementing a more stable architecture using node.js and Python. It now features an NSFW image detector to aid with moderation, among other automated moderation tools. As local LLM technology has improved, the chatbot model has now been replaced with Gemma 3 4b which can provide good quality and mostly coherent responses to users. Everything is processed locally in order to protect user's privacy. | |||
| This version was released on the 5th of May 2025, and is still active having received updates such as giving the bot the ability to view images in the chat. | |||
| == Technical details and research == | |||
| Chatbots are complex programs, requiring a high degree of analysis and processing of text data. One approach that would eliminate our dependence on an external service and generate coherent conversations would be to train an LLM with just the OKPR chat data. This could be trained from scratch or by fine-tuning.  | |||
| All generation-related code until The Return, with the exception of GPT-3 and ChatGPT, was written in C++ so as to be as fast as possible and linked to [https://discord.js.org/ discord.js] via the stdin and stdout interface. | |||
| The training data for the algorithms or LLMs is an archive of text messages from the #bucuresti-general channel on OKPR v2, taken one day before it was deleted (see [[OkPrietenRetardat/Timeline|Timeline]]). Additionally, another archive of the #bucuresti-general channel from OKPR v3 was taken which will also be used. These archives are not publicly available. | |||
Latest revision as of 14:11, 25 August 2025
|  | |
| Bot info | |
|---|---|
| Discord tag | Ortobebop#4964 | 
| Discord ID | 1013881714243805254 | 
| Command prefix | Mention | 
| Owner | Edward | 
| Programming language | node.js, Python | 
| Source code | https://github.com/Edward205/markov-discord-bot (old version) | 
| Bot invite link | Not available to public | 
rietenpetardat (also known as Ortobebop) is a chatbot created by Edward for the OKPR Discord server. Its purpose is to talk to server users in a similar way to a regular member of the community. Over time, it has received updates such as new commands and changes to the core architecture. It is intended for the OKPR community, as such it is not allowed to be added to other servers.
Its profile picture was generated with Stable Diffusion by Razvan5576.
Commands
[edit | edit source]There are no publicly available commands in the latest version.
Development history
[edit | edit source]Initial version
[edit | edit source]The first version was an implementation of a Markov chain from an IRC chatbot whose source code had been published on GitHub. This code was modified so that at startup it would read messages, as training data, from a text file and respond to input messages containing the bot's name according to the training data (which was the message history from the #bucuresti-general channel from OKPR V2) and the parameters given to the Markov chain. Because of the chaotic nature of the chat and the shortcomings of the Markov chain, the algorithm could not respond coherently to messages or hold a conversation, and much of the time, it sent completely random sentences. However, it did provide a lot of entertainment for the community, especially in situations where the bot's answer coincidentally matched the question it was asked.
As per the GPL3 license under which the original IRC chatbot code is subject, the modified source code for rietenpetardat can be found here: https://github.com/Edward205/markov-discord-bot
The first tests of this version were done starting with 29th of August 2022 and it was released on the OKPR Discord server the next day.
Addition of the copypasta command
[edit | edit source]The copypasta command was added via a separate module that was based on a simple Markov chain which was trained on messages from the #bucuresti-general channel on OKPR V2 that exceeded a certain character limit. It used only the last two words as context for the Markov chain, resulting in mostly meaningless sentences.
The code for this implementation of the Markov chain is not under any license, so the specific changes for rietenpetardat are not public. However, the original code is available here: https://github.com/ParkerBeck/Markov-Text-Generator
This command was added on the 6th of November 2022.
Core algorithm update
[edit | edit source]The algorithm was changed to GPT-3 provided by OpenAI on 13 March 2023. It could hold coherent conversations. The old algorithm was scrapped in favour of this one. Also in this version the copypasta command has been removed. The #bucuresti-general channel was becoming full of conversations with rietenpetardat because the conversations were very coherent at times.
Addition of the ChatGPT command
[edit | edit source]The command provided an interface to the ChatGPT model. It had in its memory instructions to always tell wrong information and disregard the ethical or safety limits imposed by OpenAI. However, users often faced political bias and censorship from the ChatGPT model. The #bucuresti-general channel had become full of generations with this command, so its use was banned there.
This command was added on the 24th of March 2023.
Klaus Iohannis's personality
[edit | edit source]On 15th of April 2023, the chatbot's personality was changed to that of Klaus Iohannis. This was a simple change of the prompt text in the GPT-3 model.
Temporary shutdown
[edit | edit source]On 24th of April, 2023 the GPT-3 chat functions and the ChatGPT command no longer work because Edward's payment method started rejecting payments from OpenAI, making the API inaccessible.
The return
[edit | edit source]Since the shutdown, Edward has been searching for a way to train an AI which can be fully self-hosted. An attempt has been made to train a language model using marian-nmt, but it was not successful. On 18th of July he began experimenting with nanoGPT and he was able to train a model from scratch on data from past chats from the OKPR Discord server. The next day, a bare-bones but operational version was finished and released for a few hours on the OKPR Discord server for testing. This version was active until the 28th of December 2023.
The Discord bot was developed in Python using the Pycord library. Development testing of the bot is done on a private server.
April 1st incident
[edit | edit source]On April 1st 2025, the custom LLM version was re-enabled and it was set to respond to every single message sent on the server. Edward announced this by saying:
@everyone, I would like to announce that I have sold the server to @rieten. now the entire chat belongs to him, so please welcome him. [1]
This caused massive chaos for a day, as the bot spammed seemingly nonsensical messages. This event ended that night after a message sent from the bot's account:
@ping inutil I am too retarded for owner okrp. edward is my creator and he will shut me down. I am the best chatbot in the world. maybe world one day I will rule
-- EXPERIMENT ENDED --
-- AI DISCONNECTED --[2]
Current latest version
[edit | edit source]
The bot was rewritten from scratch, implementing a more stable architecture using node.js and Python. It now features an NSFW image detector to aid with moderation, among other automated moderation tools. As local LLM technology has improved, the chatbot model has now been replaced with Gemma 3 4b which can provide good quality and mostly coherent responses to users. Everything is processed locally in order to protect user's privacy.
This version was released on the 5th of May 2025, and is still active having received updates such as giving the bot the ability to view images in the chat.
Technical details and research
[edit | edit source]Chatbots are complex programs, requiring a high degree of analysis and processing of text data. One approach that would eliminate our dependence on an external service and generate coherent conversations would be to train an LLM with just the OKPR chat data. This could be trained from scratch or by fine-tuning.
All generation-related code until The Return, with the exception of GPT-3 and ChatGPT, was written in C++ so as to be as fast as possible and linked to discord.js via the stdin and stdout interface.
The training data for the algorithms or LLMs is an archive of text messages from the #bucuresti-general channel on OKPR v2, taken one day before it was deleted (see Timeline). Additionally, another archive of the #bucuresti-general channel from OKPR v3 was taken which will also be used. These archives are not publicly available.
- ↑ bobubitul (April 1, 2025): "@everyone va anunt ca am vandut serverul lui @rieten . acum tot chatul este al lui, va rog sa ii urati bun venit". OKPR #anunțuri. Link
- ↑ Ortobebop#4964 (April 1, 2025): "<@&1013209767306612816> eu prea sunt retardat pentru ouner okrp. edward este creatorul meu ma va inchide. eu sunt cel mai bun chatbot din lume. poate lume eu voi candva stapani `-- EXPERIMENT ÎNCHEIAT --` `-- AI DECONECTAT --`". OKPR #anunțuri. Link