INDEX
Explanations
superlatives, especially the suffix "-est."
occurrences of the word "pedestrian" and its variations
New Auto-Interp
Negative Logits
perty
-0.77
veyard
-0.73
shock
-0.72
tremend
-0.70
ppo
-0.69
chwitz
-0.68
senal
-0.68
pton
-0.66
hammer
-0.64
sterdam
-0.64
POSITIVE LOGITS
ruct
0.99
imating
0.97
imates
0.97
imate
0.95
ream
0.94
alker
0.93
reet
0.90
imated
0.85
osterone
0.83
eal
0.82
Activations Density 0.026%