Text this: Compression Schemes for Mining Large Datasets