INDEX
Explanations
the word "spot" followed by a number indicating intensity
instances of the word "spot" in various contexts
New Auto-Interp
Negative Logits
issance
-0.89
yss
-0.72
rall
-0.71
RR
-0.71
adolesc
-0.68
aughter
-0.64
Strait
-0.64
godd
-0.59
ãĥ¼ãĥĨãĤ£
-0.59
ransom
-0.58
POSITIVE LOGITS
lights
1.28
eele
0.91
spot
0.89
ting
0.88
lighting
0.87
spots
0.85
codes
0.84
ter
0.81
bos
0.81
light
0.80
Activations Density 0.020%