dm.cs.tu-dortmund.de/en/mlbits/text-mining-vector-space-model/
Vector Space Model – Lecture Notes
normalize words: birds → bird , gets → get . Discard . and , .
Document-Term Matrix
Dim
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
a
and
bed
bird
but
cheese
early
feather
flock [...] experiments - part 1. Inf. Process. Manage. 36, 6 (2000), 779–808. DOI: 10.1016/S0306-4573(00)00015-7
[SpWaRo00b]
Spärck Jones, K., Walker, S. and Robertson, S.E. 2000. A probabilistic model of information …