INDEX
Explanations
references to structures like garages, houses, and dormitories
words related to structures and places
New Auto-Interp
Negative Logits
compe
-0.73
neut
-0.70
streng
-0.65
ás
-0.65
SEN
-0.64
Refer
-0.63
Contrast
-0.62
Moderate
-0.61
Lif
-0.61
Childhood
-0.60
POSITIVE LOGITS
lain
1.02
hare
0.99
rats
0.93
velt
0.84
itory
0.81
rooms
0.78
pole
0.78
maid
0.76
itual
0.74
cloth
0.74
Activations Density 0.226%