- Optimization for browsing versus search
- Hash table optimization
- Optimization for read versus write
- Optimization for against collision
- Labelled versus unlabelled
- latter is more fluid
- Constrain in Information theory
- machine memory can’t handle too many word features at the same time
- 20% of words will explain 80% of clustering