INDEX
Explanations
words with a prefix or suffix indicating an action or change
occurrences of prefixes or beginnings of words that indicate actions or feelings
New Auto-Interp
Negative Logits
consequential
-0.80
guiActiveUnfocused
-0.76
generic
-0.69
secondary
-0.67
differential
-0.67
attribution
-0.66
hostage
-0.65
sonian
-0.65
inclusive
-0.65
mandatory
-0.64
POSITIVE LOGITS
wrote
1.23
izes
1.09
ook
1.07
itates
1.07
uld
1.05
igrated
1.01
ifies
0.99
ently
0.97
cedes
0.97
saw
0.96
Activations Density 0.242%