You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This timeline visualises the prevalence of topics over time in a temporally extended corpus, as well as the prominence of the topics in the texts.
The following examplifies the timeline visualisation for a corpus consisting of Swedish news about diabetes. The y-axis shows 24 topics that have been automatically extracted from the corpus and the x-axis shows the date associated with the texts in the corpus. Each text is represented by a vertical line. The circles represent the level of association between the topic and the text. The larger the circle, the closer the association. Many overlapping circles at a certain date indicates that many texts on this topic were published this date.
It is possible to associate a unique hyperlink with each text that has been used for generating the timeline. The circles then become clickable links, which direct you to the web page associated with the text, for instance a web page that contains the text with its original layout. It is thereby possible to use the visualisation as a tool for locating and selecting potentially interesting texts for close reading.
To create the visualisation, two topic modelling output files are needed, as shown in . In addition to output from topic modelling, the files to be visualised also need an associated timestamp, e.g., publication date. The output files can, for instance, be created from the output of the topic modelling tool Topics2Themes, by running the script:
An example of how to run the code (and what is needed for configuring the Topics2Themes tool in order to be able to create timelines) is given in .
Dependencies
The code uses numpy and matplotlib.
Acknowledgements
The work on topic-timelines has mainly been conducted within the project ActDisease, partly with support from the research infrastructures Huminfra and InfraVis.
ActDisease: Acting out Disease: How Patient Organizations Shaped Modern Medicine: ERC Starting Grant (ERC-2021-STG 101040999)
Huminfra: National infrastructure for Research in the Humanities and Social Sciences (Swedish Research Council, 2021-00176)
InfraVis: the Swedish National Research Infrastructure for Data Visualization (Swedish Research Council, 2021-00181)
About
A timeline visualisation for automatically extracted topics