Text this: Learning Structure and Schemas from Documents