INDEX
Explanations
the names of actors starring in films
instances of the word "in" used in various contexts
New Auto-Interp
Negative Logits
llor
-0.82
incumb
-0.72
bryce
-0.69
soever
-0.68
akia
-0.68
inconven
-0.67
incl
-0.65
killed
-0.65
defect
-0.65
ileaks
-0.64
POSITIVE LOGITS
lieu
1.33
conjunction
1.17
clus
1.16
accordance
1.07
spite
1.07
regards
1.05
unison
1.05
animate
1.02
addition
1.00
disguise
1.00
Activations Density 0.562%