Building Blocks for Data-Driven Theories of Language Understanding