INDEX
Explanations
numbers with high activations occurring in a numeric order or sequence
numeric identifiers or ratings associated with entities
New Auto-Interp
Negative Logits
holder
-0.79
istically
-0.71
form
-0.68
think
-0.68
leck
-0.67
owicz
-0.66
snipp
-0.66
ging
-0.66
estine
-0.65
Sands
-0.64
POSITIVE LOGITS
mph
0.98
ILCS
0.95
dB
0.90
00000
0.86
rup
0.82
20439
0.81
dB
0.81
508
0.81
503
0.78
é¾
0.77
Activations Density 0.023%