repositories
Public datasets available through my work at ARL and UCSC.
- ARL Creative Visual Storytelling Anthology is a collection of 100 author responses to an improved creative visual storytelling exercise. Each item contains four facet entries, corresponding to Entity, Scene, Narrative, and Title. The ARL Creative Visual Storytelling Anthology was collected on Amazon Mechanical Turk.
- ARL SCOUT is The Situated Corpus Of Understanding Transactions, a multi-modal collection of human-robot dialogue in the task domain of collaborative exploration. The corpus was constructed from multi-phased Wizard-of-Oz experiments where human participants gave verbal instructions to a remotely-located robot to move and gather information about its surroundings.
- PersonaBank is a collection of 108 personal stories from weblogs that have been annotated with their Story Intention Graphs, a deep representation of the fabula of a story.
- Sarcasm Corpus V1 is a subset of the Internet Argument Corpus, including response text from quote-response pairs annotated for sarcasm.
- Persuasion and Personality Corpus is a subset of user-generated dialogs from the Internet Argument Corpus exploring the role of affect in persuasive arguments, 637 subjects profiled for the Big Five personality traits and prior beliefs about socio-political issues, and the subjects’ response after exposure to user-generated, factual vs. emotional dialogic exchanges compared to the effects on belief change to balanced, curated arguments.