INDEX
Explanations
proper nouns related to Disney
references to the name "Walt" and related figures in a specific context
New Auto-Interp
Negative Logits
variance
-0.71
tampering
-0.69
elig
-0.63
orescent
-0.62
adesh
-0.61
aval
-0.61
linkage
-0.61
Cth
-0.60
RED
-0.59
extrater
-0.58
POSITIVE LOGITS
Disney
1.02
Whitman
1.02
rip
0.98
enthal
0.92
zing
0.89
stad
0.88
Walt
0.88
pins
0.81
ter
0.81
pin
0.78
Activations Density 0.010%