Call for Papers

NAACL-HLT 2012 Joint Workshop on

Automatic Knowledge Base Construction and

Web-scale Knowledge Extraction (AKBC-WEKEX)

June 7-8, 2012, Montreal, Canada

Recently, there has been a significant amount of interest in automatically creating large-scale knowledge bases (KBs) from unstructured text. The Web-scale knowledge extraction task presents a unique set of opportunities and challenges. The resulting knowledge bases can have the advantage of scale and coverage. They have been enriched by linking to the Semantic Web, in particular the growing linked open dataset (LOD). These semantic knowledge bases have been used for a wide variety of Natural Language Processing, Knowledge Representation, and Reasoning applications such as semantic search, question answering, entity resolution, ontology mapping etc. The automatic construction of these KBs has been enabled by research in areas including natural language processing, information extraction, information integration, databases, search and machine learning. There are substantial scientific and engineering challenges in advancing and integrating such relevant methodologies.

With this year’s workshop, we would like to resume the positive experiences from two previous workshops: AKBC-2010 and WEKEX-2011. The joint AKBC-WEKEX workshop will serve as a forum for researchers working in the area of automated knowledge harvesting from text. By having invited talks by leading researchers from industry, academia, and the government, and by focusing particularly on vision papers, we aim to provide a vivid forum of discussion about the field of automated knowledge base construction.

Topics of Interest

Topic of interest include, but are not limited to:

  • information integration; schema alignment; ontology alignment; ontology construction
  • monolingual alignment, alignment between knowledge bases and text
  • joint inference between text interpretation and knowledge base
  • pattern and semantic analysis of natural language, reading the web, learning by reading
  • scalable computation; distributed computation; probabilistic databases
  • information retrieval; search on mixtures of structured and unstructured data
  • machine learning; unsupervised, lightly-supervised and distantly-supervised learning; learning from
  • naturally-available data
  • human-computer collaboration in KB construction; automated population of wikis
  • dynamic data, online/on-the-fly adaptation of knowledge
  • inference; scalable approximate inference
  • languages, toolkits and systems for automated knowledge base construction
  • demonstrations of existing automatically-built knowledge bases

Important Dates

  • April 6, 2012 (11:59pm SST [UTC -11 hours]): Papers due
  • April 24, 2012: Notification of acceptance
  • May 7, 2012: Camera ready due
  • June 7-8, 2012: Workshop

Submission Information

We welcome ongoing and exciting preliminary work. We are particularly interested in visionary paper submissions. We aim for papers that express intriguing and promising ideas — focusing less on where science is today and more on where it should go tomorrow.

Please format your papers using the two-column NAACL-HLT 2012 proceedings format, and restrict it to 4 pages (excluding references). Reviewing will be double-blind, so please remove author names and affiliations, and also any self references that might reveal author identities. All accepted papers will be presented as posters. Additionally, the top 3 papers as determined by the PC will also be presented in an oral session.

Please submit your papers at by April 6, 2012.

Invited Talks (all confirmed)

Nilesh Dalvi (Yahoo Research)
Bonnie Dorr (DARPA)
Oren Etzioni (UWashington)
James Fan & Aditya Kalyanpur (IBM Research)
Eduard Hovy (USC ISI)
Andrew McCallum (UMass)
Tom Mitchell (CMU)
Patrick Pantel (Microsoft Research)
Fernando Pereira (Google Research)
Chris Re (UWisconsin)
Steffen Staab (U Koblenz)

Organizing Committee

James Fan (IBM Research)
Raphael Hoffman (UWashington)
Aditya Kalyanpur (IBM Research)
Sebastian Riedel (UMass Amherst)
Fabian Suchanek (MPI)
Partha Pratim Talukdar (CMU)

Steering Committee

Oren Etzioni (UWash)
Andrew McCallum (UMass)
Fernando Pereira (Google Research)
Gerhard Weikum (MPI)

Program Committee

Amol Deshpande (UMaryland)
Anhai Doan (UWisconsin Madison)
Tony Fader (UWash)
Zack Ives (UPenn)
Alon Halevy (Google Research)
Chris Re (UWisconsin Madison)
Alan Ritter (UWash)
Andrew McCallum (UMass)
Patrick Pantel (Microsoft Research)
Marius Pasca (Google Research)
Sunita Sarawagi (IIT Bombay)
Sameer Singh (UMass)
Gerhard Weikum (MPI)
Xiao Ling (UWash)
Limin Yao (UMass)
Ken Barker (IBM Research, US)
Alfio Gliozzo (IBM Research, US)
Soren Auer (University of Leipzig, Germany)
Peter Clark (Vulcan Inc, US)
Eduard Hovy (USC/ISI, US)
Vladimir Kolovski (Novartis, US)
Goran Nenadic (University of Manchester, UK)
Martin Theobald (Max-Planck-Institut für Informatik, Germany)
Oren Etzioni (University of Washington)