Conférence
Notice

ImageGraph: A Visual Programming Language for the Visual Digital Humanities

Réalisation : 20 octobre 2020 - Mise en ligne : 1 septembre 2021
  • document 1 document 2 document 3
  • niveau 1 niveau 2 niveau 3
Descriptif

One of the greatest barriers to entry in the developed use of machine learning and computer vision by humanities scholars is the extensive technical skill required to program in computer vision frameworks (e.g. Tensorflow, pyTorch, Caffe); which can be challenging even for skilled data scientists and Python programmers. The recent move towards "drag-and-drop" interfaces for computer vision (e.g. the freemium application RunwayML) goes some way to increasing the accessibility of pretrained networks; but such interfaces generally do not allow users to train new models for their own research, or to manipulate and combine different models in sophisticated pipelines.

Taking inspiration from the sophisticated visual graph-based toolsets of contemporary music (e.g. PureData), ImageGraph is an open-source Visual Programming Language that tries to make computer vision (and other forms of machine learning) more accessible. Users write a computer program through a drag-and-drop graphical interface. This computer program is then compiled into Python and Tensorflow, uploaded automatically to GitHub (including with version control), and run on the Cloud in Google Colab. ImageGraph thus lowers the barriers to the use of computer vision and machine learning in terms of both technical ability and hardware requirements; and extensions for natural language processing and general data science are in active development.

Leonardo Impett (University of Durham)

Vidéo MemoRekall

Thème
Discipline :

Dans la même collection