INDEX
Explanations
mentions of specific names, titles, and places
numerical rankings and statistics
New Auto-Interp
Negative Logits
.�
-0.88
.</
-0.83
.).
-0.82
?).
-0.78
goddamn
-0.77
().
-0.74
darn
-0.74
.–
-0.70
.*
-0.69
.?
-0.69
POSITIVE LOGITS
recognise
0.96
analyse
0.90
organise
0.89
neighbour
0.88
alys
0.88
organisers
0.87
oeuv
0.86
realise
0.86
manoeuv
0.85
organising
0.85
Activations Density 2.254%