INDEX
Explanations
negative statements or negations
negations and the phrase "isn't" in various contexts
New Auto-Interp
Negative Logits
tein
-0.74
gnu
-0.66
creations
-0.60
Properties
-0.58
endeavors
-0.58
Supported
-0.57
doms
-0.55
WARN
-0.54
pursuits
-0.54
ritch
-0.53
POSITIVE LOGITS
hin
1.01
ibaba
1.00
anybody
1.00
enough
0.99
unanim
0.98
anyone
0.98
anything
0.97
any
0.96
enough
0.93
room
0.90
Activations Density 0.079%