INDEX
Explanations
the prefix "un-" followed by an action or state in the text
words or prefixes related to the concept of "un-," indicating negation or reversal
New Auto-Interp
Negative Logits
watered
-0.68
unpop
-0.63
recurring
-0.63
bedrooms
-0.63
Simpson
-0.62
Tut
-0.61
untold
-0.61
chase
-0.60
unexplained
-0.59
home
-0.59
POSITIVE LOGITS
rave
1.26
plug
1.17
loading
1.11
cles
1.11
offic
1.10
load
1.10
apolog
1.09
zip
1.05
ifies
1.04
seat
1.04
Activations Density 0.022%