Skip to content

A two-part workshop series collecting posts from Reddit and applying a range of text analytic techniques

Notifications You must be signed in to change notification settings

QUT-Digital-Observatory/a_glamorous_intro_to_text_analytics

Repository files navigation

A Glamorous Introduction to Text Analytics

In this two-part workshop series, you will learn to collect posts from Reddit - using the very sparkly example of the Eurovision Song Contest - and apply a range of text analytic techniques to them. Some experience with R coding preferred, no experience with social media data collection or text analysis required.

Running locally

Development and testing was done with Python version 3.12 and R version 4.4.

Setup using Anaconda and RStudio

TO WRITE

Setup using shell / command line interface

These instructions assume a little familiarity with installing and running Python packages from the command line, and that Python and R are already installed. All shell commands provided are run from this repository root directory.

We recommend creating and activating a new Python virtual environment.

Install Python requirements:

pip install -r python_requirements.txt

Open an R shell with:

R

Install R notebook kernel:

install.packages('IRkernel')
IRkernel::installspec()

To quit the R shell, use q() or the keyboard shortcut Ctrl+D (Cmd+D on Mac).

You may now as desired run individual notebooks with:

jupyter notebook 01_data_collection/explore.ipynb

Or you may as desired run a Jupyter Lab server for the whole project with:

jupyter lab

Development

For additional setup steps for contributing code to this repository, see CONTRIBUTING.md.

Authors and license

Code and documentation by QUT Digital Observatory and the Language Technology and Data Analysis Laboratory (LADAL) and licensed under CC BY 4.0

Support provided by the Language Data Commons of Australia (LDACA) (and therefore ARDC) and the QUT Library.

About

A two-part workshop series collecting posts from Reddit and applying a range of text analytic techniques

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published