INDEX
Explanations
phrases indicating singularity or exclusivity
instances of the word "only."
New Auto-Interp
Negative Logits
gnu
-0.64
insula
-0.63
communication
-0.62
actionDate
-0.62
volt
-0.61
metic
-0.60
charism
-0.60
mantle
-0.58
dict
-0.57
aptic
-0.57
POSITIVE LOGITS
marginally
0.82
onso
0.81
ices
0.80
incidentally
0.71
thia
0.69
ICES
0.69
kidding
0.68
accepts
0.66
lasts
0.66
phies
0.66
Activations Density 0.042%