INDEX
Explanations
comparisons using the word "like"
metaphors and comparisons in descriptions
New Auto-Interp
Negative Logits
ason
-0.71
Ezek
-0.70
everal
-0.69
ename
-0.66
elfth
-0.66
rade
-0.65
rue
-0.65
mber
-0.65
aven
-0.63
oubted
-0.63
POSITIVE LOGITS
except
0.99
minus
0.84
Except
0.84
wherein
0.79
minus
0.74
albeit
0.74
whereby
0.71
insofar
0.69
kinda
0.66
ombies
0.65
Activations Density 0.464%