INDEX
Explanations
references to numbers or figures, particularly repetitions of the number 800 at the highest activation level
instances of a specific numerical sequence, particularly variations of "800."
New Auto-Interp
Negative Logits
lict
-0.81
uked
-0.77
bid
-0.76
rely
-0.72
isot
-0.69
mast
-0.68
lying
-0.68
ributed
-0.67
handc
-0.66
shepherd
-0.66
POSITIVE LOGITS
800
1.08
700
0.95
989
0.86
MHz
0.85
600
0.84
888
0.83
æ©Ł
0.82
msec
0.81
400
0.80
889
0.80
Activations Density 0.012%