INDEX
Explanations
instances of significant change or transformation
New Auto-Interp
Negative Logits
amina
-0.84
ç«
-0.68
ngth
-0.68
Ys
-0.64
Goodman
-0.64
Huck
-0.63
trak
-0.63
True
-0.62
zees
-0.62
mination
-0.61
POSITIVE LOGITS
drastically
0.95
radically
0.91
tack
0.89
gears
0.86
iating
0.86
dramatically
0.83
iations
0.80
effected
0.78
diapers
0.76
atile
0.75
Activations Density 1.700%