This sub-report is part of a broader project aiming to map problematic narratives about the Russian-Ukrainian war since the beginning of the new phase of the Russo-Ukrainian war in February 2022. ‘Problematic narratives’ in this instance refer to the substance of persuasion campaigns, whether coordinated or organic and participatory.
We seek to identify these narratives to expose them as such and make them available to verification specialists as journalists, fact-checkers, disinformation researchers, human rights activists and educators. Typically these specialists work on single posts, producing fact-check’s or debunks of each. These single stories, however, may be part of larger narratives as described above.In this report, we present both the step by step we created to make possible the analysis of Telegram data employing the GarganText tool and the path followed to answer the questions focused specifically in Brazil, made during the first week of the Summer School.
Which strains of false information about the war in Ukraine emerge in Brazilian Telegram channels? Can we distill the main narratives?
How does a leading AI platform make sense of the main keywords taken from problematic channels and construct their most significant narratives?
The sequence of steps followed to develop the analysis are represented in Figure 1. First, relying on Telegram channels that spread problematic content about the Ukrainian war, we sourced additional channels by capturing their incoming and outgoing mentions and forwarding using TGSTAT ( tgstat.com). Using 4CAT to access the Telegram API, we downloaded the channels’ posts from February, 2022 to July, 2023.
The channel seeds and corresponding table were the following:
Seed channels
@guerranaucraniaultimasnoticias
@Grupos_Atualidades
@noticiasww
@ucraniavsrussia
@ucraniaerussia
Channel name | Channel Username | URL | Country |
ULTIMA HORA - SOS. | @UltimaHoraSOS | https://t.me/UltimaHoraSOS | BRA |
SELVA & AÇO | @selvaeaco | https://t.me/selvaeaco | BRA |
Randao | @randaoBR | https://t.me/randaoBR | BRA |
QUESTIONE-SE | @questioneseoficial | https://t.me/questioneseoficial | BRA |
Mundo PrintE | @Mundo_Printe | https://t.me/Mundo_Printe | BRA |
FreedomNews | @freedomnewsforyou | https://t.me/freedomnewsforyou | BRA |
Fora do Sistema | @EsquerdaFora | https://t.me/EsquerdaFora | BRA |
CONECTIUM ... Inscreva-se | @conectium | https://t.me/conectium | BRA |
Gafanhotos do Mercado - ONG - OB, Forex, B3, Crypto etc. TODO MATERIAL AQUI É GRATUITO E SEM FINS LUCRATIVOS | @canalgafanhotosdomercado | https://t.me/canalgafanhotosdomercado | BRA |
Brasilposting | @brasilposting | https://t.me/brasilposting | BRA |
FIM DOS TEMPOS PROFETIZADO | @batalhando_pela_fe_pr_alexandre | https://t.me/batalhando_pela_fe_pr_alexandre | BRA |
Random shit 🔌🔌 | @corvustt | https://t.me/corvustt | BRA |
GUERRA🚀RUSSIA/UCRÂNIA🗽 | @guerranaucraniaultimasnoticias | https://t.me/@guerranaucraniaultimasnoticias | BRA |
GUERRA RUSSIA X UCRÂNIA - ATUALIDADES | @Grupos_Atualidades | https://t.me/@Grupos_Atualidades | BRA |
Notícias da guerra e do mundo Rússia e Ucrânia | @noticiasww | https://t.me/@noticiasww | BRA |
🚨NEWS🚨 UCRÂNIA X RÚSSIA ‼️ | @ucraniavsrussia | https://t.me/@ucraniavsrussia | BRA |
‼️ Ucrânia AGORA ‼️ | @ucraniaerussia | https://t.me/@ucraniaerussia | BRA |
O INFORMANTE | https://t.me/@oinformantstarday | BRA | |
Canal Falando De Guerra | https://t.me/@falandodeguerra1 | BRA | |
Canal ARTE DA GUERRA | https://t.me/@canalartedaguerra | BRA | |
[deactivated] | @canalselvabrasiloficial | t.me/canalselvabrasiloficial | BRA |
Canal Visão Estratégica | @canalvisaoestrategica | t.me/canalvisaoestrategica | BRA |
Lucas Leiroz | @lucasleiroz | t.me/lucasleiroz | BRA |
Voz da Nova Resistência | @novaresistenciabrasil | t.me/novaresistenciabrasil | BRA |
Politicamente Incorreto I Love to Hate U 🔥🎡 | @polincbr | t.me/polincbr | BRA |
Semeando a discórdia | @SemeandoDiscordia | t.me/SemeandoDiscordia | BRA |
We developed an R script to compile the .csv files, rename its columns, create additional information, filter by keywords, clean the text to create a dataset, making it compatible with GarganText input by using tidyverse, readr and stringi packages. This code was also used by the other groups in the project that analyzed other countries' Telegramsphere. We subsequently loaded those posts in Gargantext, the text mining and analysis software, in order to group the terms and find narratives.
The code was based on the following sequence of steps:
#Team's country
Register the files’ country to help us to organize the code outputs before inserting them into GarganText.
#Set the working directory to your csv files source
Considering that some countries have more than one csv file to convert, the code input is the address of the folder with the csv files to compile before processing.
#Finding all csv files in the folder and using a function into all files found
An identifier registers all the csv files in the working directory and inserts it in an original function made to compile all csv files in an object called Data.
#Creating a function to adapt the column names to fit into the GarganText template.
We wrote a code able to:
Rename the columns of the csv outputs from 4CAT to fit the GarganText template, column names and order.
Creates specific and separated day (‘Publication Day’), month (‘Publication Month’) and year (‘Publication Year’) columns to fit GarganText.
Filter keywords related to the themes of interest listed by each country, considering a wordlist.
Clean the text to remove special characters that could interfere in GarganText working.
Remove columns other than Publication Day, Publication Month, Publication Year, Authors, Title, Source and Abstract.
Define the threshold of lines of the output considering GarganText limits.
Write a csv containing the country name and the date of the analysis.
#Produce a GarganText output
Finally, the output csv is compatible with GarganText to all countries.
The code was used by the Brazilian, Polish, Chinese, Italian and Indian teams, with minor adaptations made by the Brazilian team for each case.
Using Gargan Text and the Brazilian corpus, we composed a network of words and chose the ones with higher network degree to make prompting queries to ChatGPT to understand how it makes sense of these sets of keywords. After giving the context, we politely prompted:
"We are doing research in which we collected messages from Telegram channels that disseminate information and disinformation about the war in Ukraine, built a network with the most used terms and ran a modularity algorithm to calculate which cluster each node (term) belongs to. I will write the words of each cluster here and ask you to name the disinformation narratives that these terms represent. Is it okay?"
After promising to do its best, ChatGPT named the terms for each cluster.The network above was named after ChatGPT prompting results obtained after inserting each group of keywords in our prompt. Beyond naming, we also obtained a description of the content probably matching the keywords.
The topics resulting from prompts and the correspondent keywords are:
Supranational Intervention Narrative
Narrative: This narrative may suggest the idea that there is a supranational intervention or direction behind the events in Ukraine. The terms "centrobrics" and "supranational direction" can be used to imply the presence of hidden forces coordinating the situation and influencing the course of the conflict.
bad order hidden source
Words: Wave; dark picture; cryptic military communication; Russian losses; Ukrainian casualties (onda; quadro sombrio; comunicação militar enigmática; perdas russas; baixas ucranianas).
Neo-Ottoman Geopolitics and the Struggle for Influence
Narrative: This narrative may suggest that there is an ongoing geopolitical strategy involving Turkey, represented by the term "neo- Ottoman geopolitics." The narrative can emphasize the alleged Turkish ambition to expand its influence in the region and may use elements such as "Crimean Tatars" to promote this idea.
Words: BRICS, Ukraine migration, event, supranational board, submerged reality op, ‘great cauldron’ i.e. huge mixture (centrobrics ucrânia migração, evento, direção supranacional, realidade submersa op, grande caldeirão).
Narrative of Global Chaos and Conspiracy Narrative: This narrative may suggest that the war in Ukraine is just a part of a larger scenario of global collapse and chaos. The terms "collapse," "second assassination," "coup," and "vaccines" can be used to insinuate the existence of hidden and powerful forces behind these events. The narrative may claim that secret laboratories are conducting sinister research involving unimaginable secrets, such as election manipulation and the dissemination of vaccines for malicious purposes.Words: Russian borders, waves, Kurdish terrorists, neo-Ottomanist geopolitics, Crimean Tatars, own culture civilization, irks Russians, Syrian immigrants (Fronteiras russas, ondas, terroristas curdos, geopolítica neootomanista, tartaros crimeanos, própria civilização cultura, irrita russos, imigrantes sírios).
Viral Conspiracy Theories
Narrative: This narrative may suggest that a mysterious virus is being used as a weapon, affecting both Russian and Ukrainian military forces. Claims of a "wave" or sudden increase in disease cases can be used to reinforce this theory.
Words: collapse, humanity, coup, vaccines, election, second murder, laboratories did research, test, unimaginable secrets (colapso, humanidade, golpe, vacinas, eleição, segundo assassinato, laboratórios fizeram pesquisas, teste, segredos inimagináveis).
The network of narratives about the war that circulated in a sample of the Brazilian Telegram is presented in Figure 2. While the findings of the analyzed corpus align with the messages collected, the presence of mixed conspiracy theories within the Telegram groups posed a challenge for GarganText in effectively distinguishing and categorizing them within the network. Consequently, this blending of various theories diminished the significance of more specific and relatively less impactful false information, while elevating fringe conspiracy theories to the forefront.
Based on the relevant terms found this way, ChatGPT was able to trace back the narratives and produce labels compatible with the fringe content even using just a combination of a few keywords instead of the entire corpus inserted in Gargan Text. It is interesting that instead of corresponding to specific issues (like the pandemic or the war), the topics resulting from prompts bring different perspectives through which we could make sense of these events (if the pandemic is part of a conspiracy theory or just another example of global chaos and how the world is coming to an end; if the war is part of a supranational effort of known organizations or something made by hidden actors willing to secretly regain power).
Though generic, the topics correspond to the messages found on Telegram, and indicate an interesting way to generate hypotheses and trace back narratives assisted by generative A.I. based solely on keywords related to each content.
Gagolewski, M. and Tartanus, B. (2023). stringi: Fast and Portable Character String Processing Facilities. Available at: https://cran.r-project.org/web/packages/stringi/index.html.
Peeters, S. and Hagen, S. (2021). The 4CAT Capture and Analysis Toolkit: A Modular Tool for Transparent and Traceable Social Media Research. SSRN Scholarly Paper, Rochester, NY. doi:10.2139/ssrn.3914892.
Wickham, H. (2021). tidyverse: Easily Install and Load the ‘Tidyverse’. https://CRAN.R-project.org/package=tidyverse.
Wickham, H. et al. (2023). readr: Read Rectangular Text Data. https://cran.r-project.org/web/packages/readr/index.html.