INDEX
Explanations
words related to physical or abstract spaces
mentions of physical space or spatial concepts
New Auto-Interp
Negative Logits
vengeance
-0.76
venge
-0.74
usted
-0.73
grades
-0.70
inger
-0.69
otine
-0.66
ctive
-0.66
iamond
-0.65
igers
-0.65
Simpson
-0.64
POSITIVE LOGITS
Layout
0.89
occupied
0.88
spaces
0.85
vacated
0.85
space
0.83
shuttle
0.80
occupancy
0.80
Spaces
0.78
dayName
0.77
bars
0.75
Activations Density 0.039%