The entity typing dataset used in Nonsymbolic Text Representation (EACL2017)