INDEX
Explanations
words related to specific locations, such as "Oswalt" and "Islington."
references to specific individuals and locations
New Auto-Interp
Negative Logits
Frozen
-0.77
Illusion
-0.76
Masquerade
-0.75
Apocalypse
-0.73
Hawaiian
-0.72
vape
-0.71
ĸļ
-0.70
Cinderella
-0.70
illac
-0.69
Crunch
-0.68
POSITIVE LOGITS
Osw
2.90
lington
1.53
orthy
1.51
Yar
1.37
Berman
1.29
abilia
1.24
Rutherford
1.10
Thames
1.09
leftist
1.02
Torres
0.98
Activations Density 0.062%