INDEX
Explanations
adverbs ending in 'ly'
New Auto-Interp
Negative Logits
ilater
-0.83
icket
-0.74
unta
-0.70
ngth
-0.69
Mickey
-0.68
ococ
-0.68
BAT
-0.67
Jewel
-0.66
leground
-0.65
eers
-0.64
POSITIVE LOGITS
speaking
0.89
entimes
0.80
tics
0.76
few
0.76
than
0.70
suffice
0.66
zed
0.65
translated
0.64
adv
0.64
sized
0.64
Activations Density 0.032%