INDEX
Explanations
mentions of specific locations or points of interest
occurrences of the word "spot."
New Auto-Interp
Negative Logits
issance
-0.91
yss
-0.71
perty
-0.66
pend
-0.65
wake
-0.64
Strait
-0.63
anwhile
-0.62
jri
-0.62
godd
-0.61
velt
-0.61
POSITIVE LOGITS
lights
1.49
ter
1.05
light
1.01
ters
0.98
lighting
0.98
ting
0.97
ty
0.94
pots
0.89
tery
0.87
kick
0.85
Activations Density 0.033%