INDEX
Explanations
mentions of physical space or spatial concepts
references to physical space or capacity
New Auto-Interp
Negative Logits
usted
-0.77
iamond
-0.77
ctive
-0.74
otine
-0.69
ku
-0.68
cause
-0.68
tl
-0.67
venge
-0.66
kai
-0.66
decap
-0.64
POSITIVE LOGITS
space
0.93
spaces
0.90
shuttle
0.81
constraints
0.79
habitats
0.76
bars
0.75
occupied
0.74
kat
0.73
Layout
0.73
ngth
0.72
Activations Density 0.028%