open source Archives – Public Data Lab

New article on GitHub and the platformisation of software development

Posted on December 8, 2023 | by Public Data Lab

An article on “The platformisation of software development: Connective coding and platform vernaculars on GitHub” by Liliana Bounegru has just been published in Convergence: The International Journal of Research into New Media Technologies.

The article is accompanied by a set of free tools for researching Github co-developed by Liliana with the Digital Methods Initiative – including to:

Extract the meta-data of organizations on Github
Extract the meta-data of Github repositories
Scrape Github for forks of projects
Scrape Github for user interactions and user to repository relations
Extract meta-data about users on Github
Find out which users contributed source code to Github repositories

The article is available open access here. The abstract is copied below.

This article contributes to recent scholarship on platform, software and media studies by critically engaging with the ‘social coding’ platform GitHub, one of the most prominent actors in the online proprietary and F/OSS (free and/or open-source software) code hosting space. It examines the platformisation of software and project development on GitHub by combining institutional and cultural analysis. The institutional analysis focuses on critically examining the platform from a material-economic perspective to understand how it configures contemporary software and project development work. It proposes the concept of ‘connective coding’ to characterise how software intermediaries such as GitHub configure, valorise and capitalise on public repositories, developer and organisation profiles. This institutional perspective is complemented by a case study analysing cultural practices mediated by the platform. The case study examines the platform vernaculars of news media and journalism initiatives highlighted by Source, a key publication in the newsroom software development space, and how GitHub modulates visibility in this space. It finds that the high-visibility platform vernacular of this news media and journalism space is dominated by a mix of established actors such as the New York Times, the Guardian and Bloomberg, as well as more recent actors and initiatives such as ProPublica and Document Cloud. This high-visibility news media and journalism platform vernacular is characterised by multiple F/OSS and F/OSS-inspired practices and styles. Finally, by contrast, low-visibility public repositories in this space may be seen as indicative of GitHub’s role in facilitating various kinds of ‘post-F/OSS’ software development cultures.

forestscapes listening lab at re:publica 23, Berlin, 5-7th June

Posted on June 4, 2023 | by Public Data Lab

As part of the forestscapes project we’re organising a listening lab at re:publica 23, the digital society festival in Berlin, 5-7th June 2023:

How can generative soundscape composition enable different perspectives on forests in an era of planetary crisis? The forestscapes listening lab explores how sound can serve as a medium for collective inquiry into forests as living cultural landscapes.

The soundscapes are composed with folders of sound from different sources, including field recordings from researchers, sound artists and forest practitioners, as well as online sounds from the web, social media and sound archives. They are composed using custom scripts with the open source supercollider software as well as open source norns device, a “sound machine for the exploration of time and space”.

The re:publica installation will include soundscapes from workshops in London and Berlin – including some new pieces from the Environmental Data, Media, and the Humanities hackathon last week.

Cross-posted from jonathangray.org.

Network exploration on the web: an interview with Gephi Lite

Posted on May 23, 2023 | by Public Data Lab

Following the recent release of Gephi Lite, an open-source web-based visual network exploration tool, we interviewed its developers about the background of the project, what they’ve done and future plans…

What is Gephi Lite?

Gephi Lite can actually be defined in two ways. The first definition follows the name we chose: Gephi Lite is a lighter version of the Gephi desktop software, targeting users who need to work on smaller networks with less complex operations in mind.

The second definition is more focused on the technical context: Gephi Lite is a serverless web application to drive visual network analysis. There are no more requirements than an internet connection and a modern web browser.

Continue reading →

“What actually happened? The use and misuse of Open Source Intelligence (OSINT)”, Digital Methods Winter School and Data Sprint 2023

Posted on October 27, 2022 | by Jonathan Gray

Applications are now open for the Digital Methods Winter School and Data Sprint 2023 which is on the theme of “What actually happened? The use and misuse of Open Source Intelligence (OSINT)”.

This will take place on 9-13th January 2023 at the University of Amsterdam. Applications are accepted until 1st December 2022.

More details and registration links are available here and an excerpt on this year’s theme and the format is copied below:

The Digital Methods Initiative (DMI), Amsterdam, is holding its annual Winter School on the ‘Use and Misuse of Open Source Intelligence (OSINT)’. The format is that of a (social media and web) data sprint, with tutorials as well as hands-on work for telling stories with data. There is also a programme of keynote speakers. It is intended for advanced Master’s students, PhD candidates and motivated scholars who would like to work on (and complete) a digital methods project in an intensive workshop setting. For a preview of what the event is like, you can view short video clips from previous editions of the School.
Continue reading →

Article on “Engaged research-led teaching: composing collective inquiry with digital methods and data”

Posted on July 11, 2022 | by Public Data Lab

A new article on “Engaged research-led teaching: composing collective inquiry with digital methods and data” co-authored by Jonathan Gray, Liliana Bounegru, Richard Rogers, Tommaso Venturini, Donato Ricci, Axel Meunier, Michele Mauri, Sabine Niederer, Natalia Sánchez-Querubín, Marc Tuters, Lucy Kimbell and Anders Kristian Munk has just been published in Digital Culture & Education.

The article is available here, and the abstract is as follows:

This article examines the organisation of collaborative digital methods and data projects in the context of engaged research-led teaching in the humanities. Drawing on interviews, field notes, projects and practices from across eight research groups associated with the Public Data Lab (publicdatalab.org), it provides considerations for those interested in undertaking such projects, organised around four areas: composing (1) problems and questions; (2) collectives of inquiry; (3) learning devices and infrastructures; and (4) vernacular, boundary and experimental outputs. Informed by constructivist approaches to learning and pragmatist approaches to collective inquiry, these considerations aim to support teaching and learning through digital projects which surface and reflect on the questions, problems, formats, data, methods, materials and means through which they are produced.

Make a deal with Gephisto

Posted on May 11, 2022 | by Public Data Lab

Mathieu Jacomy and Anders Munk, TANT Lab & Public Data Lab

6 minutes read

Gephisto is Gephi in one click. You give it network data, and it gives you a visualization. No settings. No skills needed. The dream! With a twist.

Gephisto produces visualizations such as the one above. It exists as a website, and you can just try it below. It includes test networks, you don’t even need one. Do it! Try it, and come back here. Then we talk about it.

https://jacomyma.github.io/gephisto/

Continue reading →

Introducing Memespector-GUI: A Graphical User Interface Client for Computer Vision APIs

Posted on October 27, 2021 | by Jason Chao

In this post Jason Chao, PhD candidate at the University of Siegen, introduces Memespector-GUI, a tool for doing research with and about data from computer vision APIs.

In recent years, tech companies started to offer computer vision capabilities through Application Programming Interfaces (APIs). Big names in the cloud industry have integrated computer vision services in their artificial intelligence (AI) products. These computer vision APIs are designed for software developers to integrate into their products and services. Indeed, your images may have been processed by these APIs unbeknownst to you. The operations and outputs of computer vision APIs are not usually presented directly to end-users.

The open-source Memespector-GUI tool aims to support investigations both with and about computer vision APIs by enabling users to repurpose, incorporate, audit and/or critically examine their outputs in the context of social and cultural research.

What kinds of outputs do these computer vision APIs produce? The specifications and the affordances of these APIs vary from platform to platform. As an example here is a quick walkthrough of some of the features of Google Vision API…

Continue reading →

Using ObservableHQ notebooks for gathering and transforming data in digital research

Posted on October 14, 2021 | by Tommaso Elli

We’ve recently been experimenting with the use of ObservableHQ notebooks for gathering and transforming data in the context of digital research. This post walks through a few recent examples of notebooks from recent Public Data Lab projects.

In one project we wanted to use the CrowdTangle “Links” API to fetch data about how certain web pages were shared online and across different platforms. After gaining access to relevant end points, we could adopt different means to call the APIs and retrieve data: such as using something like Postman (a general-purpose interface to call endpoints), or writing custom scripts (for example in Python or Javascript).

Code notebooks are a third option that lies somewhere in between these options. Designed for programmers, notebooks allow for iterative manipulation and experimentation with code whilst keeping track of creative processes by commenting on the thinking behind each step.

Notebooks allow us to both write and run custom scripts as well as creating simple interfaces for those who may not code. Thus we can use them to help researchers, students and external collaborators to collect data, making it easier to call APIs, setting parameters, or perform manipulations.

ObservableHQ is one solution for writing programming notebooks, it runs in the browser and is oriented towards data and visualisations (“We believe thinking with data is an essential skill for the future”). Hence, we thought it could be a good starting point for what we wanted to do.

Continue reading →

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31